ISSN 0016-2523 


ss 
FUJITSU 


SCIENTIFIC & TECHNICAL 
JOURNAL 


Winter 1991 vo27.Nno4 


FUJITSU Sci. Tech. J., 27, 4, pp. 299-378, Kawasaki. Winter, 1991 


The Issue’s Cover : 

Phase by Hiroftumi OHKUMA 
When supercooled water changes from the liquid 
to solid phase, the change is rapid. 
One molecule sets off a chain reaction that results 
in an expansion due to the supercooling energy. 


FUJITSU Scientific & Technical Journal is published quarterly by FUJITSU 
LIMITED of Japan to report the results of research conducted by FUJITSU 
LIMITED, FUJITSU LABORATORIES LTD., and their associated companies in 
communications, electronics, and related fields. It is the publisher's intent that 
FSTJ will promote the international exchange of such information, and we 
encourage the distribution of FSTJ on an exchange basis. All correspondence 
concerning the exchange of periodicals should be addressed to the editor. 

FSTJ can be purchased through KINOKUNIYA COMPANY LTD., 38-1, 
Sakuragaoka 5-Chome, Setagaya-ku, Tokyo 156, Japan, ( Telephone 
+81-3-3439-0162, Facsimile : +81-3-3706-7479 ). 

The price is US$7.00 per copy, excluding postage. 

FUJITSU LIMITED reserves all rights concerning the republication and pub- 
lication after translation into other languages of articles appearing herein. 

Permission to publish these articles may be obtained by contacting the 
editor. 


FUJITSU LIMITED Tadashi Sekizawa, President 
FUJITSU LABORATORIES LTD. Mikio Ohtsuki, President 


Editorial Board 

Editor Shigeru Sato 
Associated Editor Hideo Takahashi 
Editorial Representatives 


Sadao Fujii Tetsuya Ilsayama Ken-ichi Itoh 
Yoshihiko Kaiju Yoshimasa Miura Makoto Mukai 
Yasushi Nakajima Hiroshi Nishi Hajime Nonogaki 
Seiya Ogawa Shinji Ohkawa Shinya Okuda 
Hirofumi Okuyama Teruo Sakurai Tohru Sato 

Yoshio Tago Shozo Taguchi Hirobumi Takanashi 
Mitsuhiko Toda Toru Tsuda Takao Uehara 


Akira Yoshida 


Editorial Coordinators 
Yukichi lwasaki 
Kazuo Yono 


FUJITSU LIMITED 1015 Kamikodanaka, Nakahara-ku, 
Kawasaki 211, Japan 

Cable Address : FUJITSULIMITED KAWASAKI 

Telephone : +81-44-777-1111 

Facsimile : + 81-44-754-3562 


Printed by MIZUNO PRITECH Co., Ltd. in Japan 
©4991 FUJITSU LIMITED (December 14, 1991) 


Cc 
FUJITSU 


SCIENTIFIC & TECHNICAL 
JOURNAL 


Winter 1991 vo1.27.No4 


CONTENTS 


299 


317 


329 


338 


360 


369 


Photoexcited Processes for Semiconductors I: Low Temperature Epitaxy 


@ Tatsuya Yamazaki @ Takashi Ito 


Photoexcited Processes for Semiconductors II: Dry Cleaning and Dry Etching 


e@ Yasuhisa Sato @ Rinshi Sugino ®@ Takashi Ito 


Semiconductor Laser Light Sources and Receivers for Next Generation 
Optical Communication Systems 


@ Hiroshi Ishikawa 


Report on Computer Algebra Research 


@ Taku Takeshima @ Masayuki Noro e@ Kazuhiro Yokoyama 


Boron/Magnesium Sintered Composites for Computer Peripherals 
@ Eiji Horikoshi @ Takehiko Sato 


Moving-Fiber Optical Switch Using LiNbO, Piezoelectric Actuator 


@ Noboru Wakatsuki @ Hisashi Sawada @ Masanori Ueda 


SYNOPSES 
UDC 621.3.049.771.14.002.2 
FUJITSU Sci. Tech. J., 27, 4. pp. 299-316 (1991) 


Photoexcited Processes for Semiconductors |: 
Low Temperature Epitaxy 


e@ Tatsuya Yamazaki @ Takashi Ito 


Photoenhanced processing promotes low-temperature growth of 
Si epitaxial layers. Irradiation of the wafer surface with high 
power UV light excites both the vapor phase reactions of the 
disilane source gas and the surface reactions; this reduces the 
crystal growth temperature to 500°C and increases the growth 
rate. Here authors report that heavy boron doping up to 
1x 107° cm™? produced superior crystal quality at 500°C with 
an abrupt impurity profile. Photoexcitation allowed also low- 
temperature selective epitaxy without microfacets or crystal 
defects. 

A photoenhanced low-temperature epitaxial layer-used in high- 
speed bipolar transistors achieved a cutoff frequency of 43 GHz, 
and a 1/8-static frequency divider using the transistors operated 
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A photoexcited dry cleaning process using highly purified 
chlorine gas was developed as an ultraclean process for silicon 
ultra-large scale integrated-circuit (ULSI) production. The process 
aids in deep-submicron device fabrication. Metallic contaminants 
remaining on a wafer's surface after wet cleaning can be signifi- 
cantly reduced by photoexcited cleaning. Photoexcited cleaning 
can also remove contaminants that have penetrated the silicon 
substrate during reactive-ion processing by etching the contami- 
nated layer without damaging the uncontaminated layer. Thus, dry 
cleaning improves thin gate oxide integrity, SiO,-Si interface 
quality, carrier lifetime, junction leakage characteristics, and 
epitaxial film quality. 
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Next Generation Optical Communication Systems. 


@ Hiroshi Ishikawa 


Laser light sources and receivers for the next generation of 
optical communication systems currently being developed in 
Fujitsu Laboratories Ltd. are reviewed. A modulator integrated 
DFB laser and APD for high-speed systems, and multielectrode 
DFB laser and balanced receivers for coherent communication 
systems are described. 


UDC 512.714:681.32 
FUJITSU Sci. Tech. J., 27, 4, pp. 338-359 (1991) 


Report on Computer Algebra Research 
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Computer algebra is a newly developing research area that bridges 
constructive mathematics and scientific information processing 
technology. Research in this area includes design, analysis, and 
Prototyping of mathematical algorithms for symbolic and alge- 
braic computation. 

This paper describes an experimental system for computer 
algebra, named risa — Research Instrument for Symbolic Algebra. 
At present, the system forms the basic kernel of a computer 
algebra system, and is expected to become an essential engine for 
advanced future systems. As a general-purpose computer algebra 
system, risa’s operating speed is competitive and sometimes 
outstanding. This paper also briefly describes several mathematical 
results of some algebraic computations related mainly to poly- 
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A magnesium-based sintered composite, reinforced with boron 
Particles, has been developed that has an increased modulus of 
elasticity and controllable thermal expansion coefficient, A 
6vol%boron/magnesium-9wt% aluminum composite has an ex- 
cellent modulus of elasticity 1.3 times that of the ordinary mag- 
nesium alloy and a thermal expansion coefficient close to aluminum 
alloy. This composite also has improved machinability as the size 
of the boron particles is controlled. A head-arm component for a 
disk drive was experimentally produced using this composite, and 
it was found that the composite causes greatly decreased thermal 
off-track error in the disk drive. 
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Moving-Fiber Optical Switch Using 
LiNbO Piezoelectric Actuator 


e@ Noboru Wakatsuki @ Hisashi Sawada @ Masanori Ueda 


The need for a fast, miniaturized, optical switch with low 
insertion loss has increased with the proliferation of optical 
communications and optical signal processing. This paper describes 
a moving-fiber optical switch consisting of a LiNbO, piezoelectric 
actuator having an anti-polarized ferroelectric laminar domain 
(inversion layer). A new construction technique simplifies optical 
axis alignment and makes fiber end-face polishing unnecessary. 
A2 x 2 moving-fiber optical switch constructed using the above 
techniques has an insertion loss within 0.8 dB and a switching 
time less than 10 ms. 


REVIEW 


UDC 621.3.049.771.14.002.2 


Photoexcited Processes for 


Semiconductors |: 


Low Temperature Epitaxy 


®@ Tatsuya Yamazaki @ Takashi Ito 


(Manuscript received August 7, 1991) 


Photoenhanced processing promotes low-temperature growth of Si epitaxial layers. Irradia- 
tion of the wafer surface with high power UV light excites both the vapor phase reactions of 
the disilane source gas and the surface reactions; this reduces the crystal growth temperature 
to 500 °C and increases the growth rate. Here authors report that heavy boron doping up 
to 1x 107° cm produced superior crystal quality at 500 °C with an abrupt impurity 
profile. Photoexcitation allowed also low-temperature selective epitaxy without microfacets 


or crystal defects. 


A photoenhaced low-temperature epitaxial layer-used in high-speed bipolar transistors 
achieved a cutoff frequency of 43 GHz, and a 1/8-static frequency divider using the tran- 


sistors operated up to 11.8 GHz. 


1. Introduction 

High-performance integrated circuits need to 
be made smaller, both laterally and vertically. 
To achieve submicron design and ultrashallow 
junction depth, device fabrication temperatures 
must be lowered. In particular, the Si epitaxial 
growth process has the highest process tempera- 
ture of all the device fabrication processes, and 
this must be lowered to form abrupt impurity 
profiles and to eliminate contamination. The 
Si epitaxial process temperature is usually above 
1000°C”, but this causes extensive impurity 
diffusion, and hence the conventional epitaxial 
process cannot be used once the device impurity 
profiles have already been made. This process 
is therefore only used in the initial stages of 
making the device. The high temperatures also 
cause metal and oxygen contamination, and 
degrade crystal quality. 

Low-temperature epitaxial growth can be 
used in the intermediate and final stages of 
device processing to form the base and emitter 
regions of bipolar transistros, the source and 
drain in MOSFETs, and for metallization. 
There are many ways to make new device 
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structures, and low-temperature epitaxy has 
great potential for improving the performance 
and packing density of LSI chips. 

Many methods of lowering the growth 
temperature have already been investigated. 
For example, instead of using SiCI, for the 
source gas in conventional epitaxy, SiH zCl> 
or SiH, is used?» . These source gases reduce 
the growth temperature to 900°C. Below 
900 °C, however, crystal quality deteriorates 
and poly-Si begins to grow instead of single 
Si crystals. 

Molecular beam epitaxy (MBE) has also 
been developed to lower growth tempera- 
tures-®). Using MBE, an Si epitaxial layer 
can be grown under ultrahigh vacuum (UHV), 
thus reducing the growth temperature to about 
600 °C. However, the equipment required for 
low-temperature MBE is very complex and 
the unit cost is too high for device fabrication. 
Furthermore, the crystal quality grown using 
MBE is not yet good enough for device fabri- 
cation. 

Another method uses ultrahigh vacuum/ 
chemical vapor deposition (UHV/CVD)””??. 
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This method successfully lowers the growth 
temperature to 500°C while still retaining 
device-quality crystals, but the growth rate in 
the low temperature region is very small. 

Other low-temperature epitaxial growth 
methods using enhanced processes have also 
been developed. Plasma-enhanced CVD epi- 
taxy lowers the growth temperature to about 
750°C!%-!3) but charged particle remaining 
at the substrate surface cause serious damage 
and crystal defects. However, photoenhanced 
epitaxy does not cause such damage, and there 
are two types: photosensitized or direct photol- 
ysis'‘*"17). The former method uses mercury 
sensitized photolysis to reduce the growth tem- 
perature to 250°C, but the epitaxial films 
grown contain a large concentration of hydrogen 
atoms, and the crystal quality is very poor. We 
therefore used the latter type of photoenhanced 
epitaxy, direct photolysis. 

This paper describes photoepitaxial growth 
characteristics at both atmospheric pressure and 
at reduced pressure, including photoenhanced 
low-temperature selective epitaxy under ultra- 
high vacuum. The paper also describes the 
details of heavy boron doping during photo- 
epitaxy and the application of this process to 
bipolar transistors. 


2. Photoepitaxy at atmospheric pressure 

This chapter describes photoenhanced epi- 
taxy at atmospheric pressure using conventional 
epitaxy equipment with few modifications 
(see Fig. 1)'9. An ultrahigh-pressure short-arc 
mercury lamp was used as the UV source, and 
the mercury light was attenuated in the visible 
and infrared wavelengths by using a cold mirror 
(selective reflection mirror). The mercury light 
then had a wavelength range between 185 nm 
and 260 nm (see Fig. 2). The UV light irradi- 
ated the wafer surface vertically with a total 
intensity of 60 mW/cm?. Disilane (Si,H,) was 
used as the source gas. Dislane absorbs UV 
light at wavelengths shorter than 200 nm and 
directly dissociates photochemically to generate 
radicals according to the following equation (see 
Fig. 2)19))29 


300 


Cold mirror 


| 
(200-260 nm) Hg-Xe lamp 
| 


Si,H, + H, 


“— Exhaust 


Fig. 1—Experimental set up of atmospheric pressure 
photoepitaxy. 
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Light intensity (arb. units) 


150 200 250 300 350 
Wavelength (nm) 


Fig. 2—UV spectrum from a short arc mercury lamp. 
The absorption threshold of silane and disilane 
are indicated using arrows. 


Si, He + hv (<210 nm) > 
SiH * + SiH3 * + H. 
Photodissociated molecules and atoms are 
supplied to the silicon substrate, which increases 
the growth rate and improves crystal quality. 
Conventional source gases such as silane (SiH, ) 
and diclorosilane (SiH2Cl2) do not absorb UV 
light, and so do not dissociate. 

A four-inch Si(100)-oriented wafer was 
used as the substrate. After wet RCA treatment, 
the substrate was loaded into a vertical epitaxial 
reaction chamber equipped with an RF heater 
and SiC coated carbon susceptor. Before Si dep- 
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Fig. 3—-RHEED pattern from a photoepitaxially grown 
layer at 630 °C. The photoepitaxial layer thick- 
ness is 200 nm. 


Temperature (°C) 
1 100 900 700 


UV 


fo (25 kcal/mol) 


eS 


Thermal 
(39 kcal/mol) 


Growth rate (um/min) 


7.0 8.0 9.0 10.0 11.0 
10*/T (1/K) 


Fig. 4—Growth temperature dependence of growth rate. 
The UV irradiation reduced the activation energy 
from 39 kcal/mol to 25 kcal/mol below 900 °C. 


osition, the substrate was heated 1050 °C for 
10 min in hydrogen without UV irradiation to 
remove any resident oxide. After this preclean- 
ing, the substrate was then cooled down to the 
deposition temperature and irradiated by UV 
light. This did not cause the substrate surface 
temperature to increase significantly since the 
infrared wavelength had already been attenuat- 
ed. The substrate temperature was measured 
with an optical pyrometer precalibrated for the 


FUJITSU Sci. Tech. J., 27, 4, (December 1991) 


Growth rate (um/min) 


0 20 40 60 80 
UV intensity (mW/cm?) 


Fig. 5—UV intensity dependence of growth rate. For the 
growth rate using silane, UV irradiation was not 
effective. 


emissivity of silicon. During Si deposition, 
hydrogen flow rate was set at 201/min and 
dislane flow rate at 1.5 cc/min. 

We investigated the crystal quality of the 
photoepitaxial layer grown at 630°C using 
Reflection High-Energy Electron Diffraction 
(RHEED). Figure 3 clearly shows a streaked 
pattern and a fine Kikuchi line, which shows the 
crystal quality is good. In contrast, without 
UV irradiation, polycrystalline was deposited 
at 630°C. The temperature dependence of the 
Si growth rate under UV irradiation was investi- 
gated (see Fig. 4). As a comparison, the epitaxial 
growth rate using SiH, as the source gas is 
shown, since its growth is not influenced by 
light irradiation, SiH, gas does not absorb 
radiation of this wavelength. 

For temperatures higher than 900 °C, Si, H, 
gas dissociated thermally and gas-phase diffusion 
of the reactive species, which has a small temper- 
ature dependence, limited the growth rate. At 
temperatures lower than 900 °C, surface reac- 
tions determined the growth rate. Evidently, 
UV irradiation greatly reduces the activation 
energy for the Si,H, source gas. We studied 
how the UV intensity affected the growth 
rate (see Fig. 5), and found that while the rate 
is constant for a substrate temperature of 
1010 °C, the growth rate increases at 770 °C 
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Fig. 6—Impurity profile in photoepitaxial layer grown at 
830 °C on nt buried layer. The traditional region 
was 0.32 um. 


and 830°C as the light intensity increases; 
UV irradiation has a greater effect at lower 
substrate temperatures. 


A photoepitaxial layer 1.8 wm thick was 
grown on the n* buried layer formed by arsenic- 
ion implantation. We employed a growth tem- 
perature of 830°C to grow a relatively thick 
epitaxial layer (see Fig. 6). The transition dis- 
tance from the substrate interface into the 
epitaxial layer where carrier concentration was 
10'© cm73, was only 0.32 wm. Due to the low 
temperatures, the autodoping of arsenic was 
slight. 

We have thus developed a low-temperature 
photoepitaxy process in a hydrogen atmospheric 
pressure using Si,H, as the source gas. The 
Si,Hg source directly dissociates under UV 
irradiation thereby increasing the growth rate. 
The epitaxial growth temperature can be as low 
as 630°C and still produce an almost abrupt 
impurity profile. 


3. Photoepitaxy at reduced pressure 

At atmospheric pressure and temperatures 
below 700 °C, single crystals could not be grown 
fast so it was impractical to make thick epitaxial 
layers this way. To improve the growth rate 
of high-quality crystal below 700 °C, we investi- 
gated photoepitaxy at reduced pressures” BY 

We designed a special epitaxial chamber 
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Fig. 7—Experimental set up of reduced pressure 
photoepitaxy. 


(see Fig. 7). The Si wafer was mounted on an 
SiC-coated carbon susceptor in a synthesized 
quartz chamber and infrared halogen lamps 
heated the susceptor from behind. The wafer 
surface was irradiated by UV light from high- 
pressure long-arc mercury lamps with an inten- 
sity of 1.2 W/cm? (integrated value of wave- 
lengths less than 300 nm), These caused the 
surface temperature, measured by a thermo- 
couple, to rise about 30°C at 600°C. The 
base pressure was | x 1074 Pa using an oil-free 
dry pumping system and a_turbo-molecular 
pump. To prevent absorption of the UV irradia- 
tion by oxygen, the area surrounding the quartz 
chamber was filled with nitrogen. The same 
wafer and wet cleaning prior to loading specifi- 
cations were used as for atmospheric-pressure 
photoepitaxy. To remove any native oxide, the 
wafer was heated to 900°C in hydrogen at 
67 kPa for 10 min in the quartz chamber, then 
the temperature was lowered to the depositon 
temperature. 

The same disilane temperature hydrogen gas 
system was used for Si film deposition as for 
atmospheric-pressure photoepitaxy. We investi- 
gated the growth rate at temperatures below 
800 °C. From 650°C to 800°C, the rate de- 
creased linearly as the inverse of temperature, 
either with or without UV irradiation (see 
Fig. 8). The growth rate with UV irradiation was 
about five times that without UV irradiation, 
which also reduced the activation energy. Below 
650 °C, no film could be deposited without UV 
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Temperature (°C) 
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\ 
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Fig. 8—Temperature dependence of growth rate. The 
total pressure was constant at 27 kPa. 
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Fig. 9—UV intensity dependence of growth rate. The 
growth temperature was constant at 600 °C. 


irradiation, and yet in the presence of UV irradi- 
ation, a single Si crystal could be grown 
independently of temperature. We studied the 
dependence of growth rate on UV intensity at 
total pressures of 27 kPa and 13 Pa, and at a 
temperature of 600°C (see Fig. 9). At 13 Pa, 
growth rate increased with UV intensity whereas 
at 27 kPa, the growth rate was almost constant 
for weak UV intensities and increased for strong 
UV intensities. At weak UV intensities (in the 
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Without UV 


Transmittance (arb. units) 


2 400 2 300 2 200 2 100 2 000 
Wave number (cm7') 


Fig. 10—FT-IR spectra of the vapor phase in the growth 
chamber with or without UV irradiation. 


thermal growth region), the growth rate was 
faster at high total pressure and high disilane 
partial pressure than for low total pressure and 
low disilane partial pressure. Conversely, at 
strong UV intensities, the growth rate of 13-Pa 
epitaxy almost exceeded that of 27-kPa epitaxy 
despite the low disilane partial pressure. 

To investigate the dependence of growth 
rate on temperature and UV intensity, we ana- 
lyzed in-situ the vapor phase photochemical 
reaction using Fourier-Transform Infrared 
Spectroscopy (FT-IR). The post-reaction gas was 
brought from the photoepitaxial gas system into 
an FT-IR analysis cell through a quartz capillary 
positioned 1 mm above the wafer surface. The 
pressure of the analysis cell was kept constant at 
1.3 kPa. The IR absorption spectra was observed 
for both irradiated and nonirradiated reaction 
gas at room temperatures (see Fig. 10). Without 
UV irradiation, the absorption spectra at wave- 
number of 2215 cm™! and around 930 cm™! 
were missing due to silane stretching and the 
SiH; deformation modes could not be observed. 
However, after UV irradiation, these absorption 
spectra could be seen clearly. Thus, in this 
experimental system, silylene species were 
generated photochemically from disilane accord- 
ing to Equation (1). The reaction products, 
singlet silylene species SiH, and SiH3 were 
absorbed by the Si surface and then by dehy- 
drogenation the solid Si film was deposited. 

We studied the temperature dependence of 
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Temperature (°C) 


500 900 700 500 


Concentration (ppm) 


O, 0: With UV 
@, @: Without UV 


8.0 10.0 12.0 14.0 
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Fig. 1 1—Disilane and silane concentration in the vapor 
phase in the growth chamber. The concentra- 
tions were calculated from FT-IR data. 


the concentration of disilane and silane, using 
FT-IR (see Fig. 11). Above 700 °C, most of the 
disilane dissociated thermally and disilane could 
not be recognized in the FT-IR spectra even 
after UV irradiation. The silane concentration 
decreased as the growth rate increased, since the 
Si deposition used up the silylenes as they were 
produced. Below 650 °C and without UV irradi- 
ation, the concentration of silylene species 
decreased as the temperature dropped. However, 
after UV irradiation, the concentration almost 
became constant due to the photochemical 
dissociation of disilane. The temperature de- 
pendence of the concentration of the silylene 
species matched that of the growth rate, as 
shown in Fig. 8. We conclude that the growth 
rate is determined by the concentration of the 
silylene species, which are generated by thermal 
reactions in high temperature regions and 
photochemical reactions in low temperature 
regions. 

We also studied the dependence of this 
photochemical reaction of disilane in the vapor 
phase on UV intensity. At high total and disilane 
partial pressures and low UV intensity, the 
effective wavelength of the photochemical reac- 
tion of disilane apparently no longer reached the 
crystal surface because the disilane was absorbed 
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Photoepitaxial layer 
(FWHM: 3.4cm"') 


Raman intensity (arb. units) 


Substrate 
(FWHM: 3.2 cm7!) 


Raman shift (cm~') 


Fig. 12—Raman spectra of Si substrate and photo- 
epitaxial layer grown at 540 =© 


near the window through which the light came. 
In this case, the silylene species were mostly 
generated thermally. When the total and partial 
pressures of disilane were low, the photochemi- 
cal reactions had a greater effect on the growth 
rate. 

Reduced-pressure photoepitaxy made 
epitaxial growth possible at temperatures as low 
as 540 °C. The crystal quality of the layer thus 
grown could be observed by Raman scattering 
spectroscopy. The full width at half maximum 
(FWHM) of the Raman peak for bulk Si was 
about 3.2 cm”, while that for polycrystalline Si 
deposited by conventional LPCVD is about 
8 cm™!. The Raman spectra for a photoepitaxial 
layer grown at 540°C exhibited the same 
waveform and FWHM value as bulk Si (see 
Fig. 12). This shows that the crystal quality of a 
photoepitaxial layer grown at 540°C is high. 
Using high resolution TEM (HR-TEM), it was 
possible to study crystal defects and the inter- 
face structure between the photoepitaxial layer 
and substrate (see Fig. 13). No defects could be 
seen in the photoepitaxial layer of single crystals 
grown at 540 °C and there was no discontinuity 
at the interface. 

UV irradiation effectively enhanced the 
vapor-phase reaction at reduced pressure 
compared with atmospheric pressure, and there- 
fore the growth rate below 700 °C dramatically 
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Fig. 13—High resolution TEM photograph of potoepitaxial layer grown at 540 °C. The photo- 
epitaxial layer was grown on a LOCOS structure, and poly-Si grown on the field oxide. 


increased. Furthermore, the lowest epitaxial 
growth temperature was reduced from 630 °C to 
540 °C. 


4. Impurity doping 

Impurity doping during epitaxial growth is 
an important technique for device fabrication. 
We investigated heavy boron doping during 
reduced pressure photoepitaxy””), and used 
diborane (B,H,) as the in-situ doping source. 
The UV absorption threshold of diborane is 
about 190 nm, so it absorbs only a small per- 
centage of the UV radiation used in this experi- 
ment. At this threshold wavelength, the 
absorption cross section of diborane is two 
orders of magnitude lower than _ disilane; 
diborane cannot therefore be directly dissoci- 
ated photochemically??? 

For low diborane flow rates corresponding 
to doping carrier concentrations below 
1 x 10! cm™, carrier concentration increased 
linearly regardless of UV _ irradiation (see 
Fig. 14). The presence of UV irradiation slightly 
reduced doped carrier concentrations. The 
concentration of silylene species in the vapor 
phase at 600 °C was smaller without UV irradia- 
tion because disilane dissociated photochemi- 
cally. The ratio of the contribution of boron 
species to doping and disilane species to film 
deposition became large without UV irradiation. 
Therefore, for low diborane flow rates, the 
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Fig. 14—Boron doping characteristics in low concentra- 
tion range with or without UV irradiation. 


doped carrier concentration became slightly 
smaller with UV irradiation. For high diborane 
flow rates with UV irradiation, the doped 
carrier concentration was almost constant at 
L.5 x 107° cm™! (see Fig. 15). This exceeds the 
solid solubility limits for boron in Si at the 
deposition temperature of 600 °C. However, at 
these concentrations, thermal epitaxial growth 
was not possible and the boron atom activation 
deteriorated. 

The boron atom activation ratio is related to 
the carrier concentration. The boron atom con- 
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Fig. 15—Doping characteristics in high diborane flow 
rate region. With UV irradiation, the deposited 
film was a single crystal. Without UV irradia- 
tion, poly-Si or boron-silicon compounds were 
deposited. 
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Fig. 16—Activation ratio of doped boron atoms vs. 
carrier concentration. The activation ratio was 
the same as the ratio of the carrier concentra- 
tion to the concentration of doped boron 
atoms. 


centration in deposited films can be determined 
using Hall measurement and SIMS analysis (see 
Fig. 16). Above a carrier concentration of 
1 x 10!§ cm™!, the activation ratio without UV 
irradiation deteriorated, finally dropping to only 
a few percent. After UV irradiation, however, 
doped boron atoms were fully activated up to a 
concentration of 1.5 x 10?° cm™!, then the ratio 
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Fig. 17—FWHM of Raman scattering spectroscopy vs. 
doped carrier concentration. 
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Fig. 18—Hole mobility vs. carrier concentration. Experi- 
mental data was plotted. The Irvin curve was 
estimated as a plane line. 


dropped sharply to a few percent when the 
doped boron atom concentration exceeded this 
value. 

This phenomenon was strongly related to 
crystal quality. Using Raman scattering, the 
crystal quality of the heavily boron doped 
photoepitaxial layer was observed, testing the 
FWHM’s dependence on the doped carrier 
concentration at 600 °C (see Fig. 17). Without 
UV irradiation, the FWHM increased with carrier 
concentration showing that the crystal quality 
deteriorated. As the doped carrier concentration 
increased up to 1x 10!9 cm? poly-Si was 
produced, and finally boron-silicon compounds 
were deposited. But when UV irradiation was 
used, the FWHM remained almost constant at 
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3.5 cm™!, as single crystals grew up to a carrier 
concentration of 1.5 x 107° cm™3. This depend- 
ence of crystal quality on the doped carrier 
concentration agrees well with the boron activa- 
tion ratio shown in Fig. 16. The Hall mobilities 
of photoepitaxial layers grown at 600 °C were 
determined by Hall measurement as a function 
of the doped carrier concentration (see Fig. 18). 
The results fit well with an Irvin curve through- 
out the doping range, which shows that a 
photoepitaxial layer grown at 600°C with 
heavy boron doping has good crystallinity. 

The dependence of carrier doping efficiency 
on diborane flow rate is shown in Fig. 19. 
The efficiency is defined as the ratio of the total 
number of carriers in the deposited film to the 
total diborane volume introduced in the vapor 
phase (total number of boron atoms) (see 
Fig. 19). Without UV irradiation, this efficiency 
became saturated at a very low level, even lower 
than with UV irradiation. With UV irradiation, 
the efficiency increased sharply up to a flow 
rate of 2 x 107? cm?/min, which corresponds to 
a 1.5 x 107° cm? carrier concentration. Above 
this flow rate, the carrier doping efficiency 
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decreased, eventually dropping to the same level 
as without UV irradiation. These results show 
that UV irradiation enhances carrier doping 
efficiency and suggest that an enhanced diborane 
vapor-phase reaction occurs, increasing the prob- 
ability of boron species adsorption. 

Diborane photolysis in the vapor phase was 
studied using FT-IR. The IR absorption spectra 
due to the B-H3; bending mode was found to be 
a function of the disilane flow rate (see Fig. 20). 
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Fig. 19—Carrier doping efficiency vs. diborane flow rate. 
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Fig. 20—IR absorption spectra due to the B-H3 bending mode as a function of the flow rate of added disilane. 
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Fig. 21—Diborane partial pressure in the vapor phase vs. 
flow rate of added disilane. 


As the disilane flow rate was increased, the IR 
absorption decreased, suggesting that diborane 
photodissociation increased in the vapor phase, 
as shown in Fig. 21. When no disilane was 
added, the diborane partial pressures with and 
without UV irradiation, as determined by the 
IR absorption spectra, were identical, and thus 
the diborane did not dissociate photochemically. 
However, when disilane was present in the vapor 
phase, the diborane partial pressure with UV 
irradiation decreased markedly, showing that 
diborane photochemically dissociated, but with- 
out UV irradiation, very little diborane dissoci- 
ated, even in the presence of disilane. We believe 
that diborane may be dissociated photochemical- 
ly by disilane photosensitization. Photochemical- 
ly dissociated silylene species eventually collide 
with diborane molecules, dissociating them 
through energy relaxation. Thus, photochemical 
diborane dissociation in the vapor phase in- 
creases the boron species concentration and 
increases the doping efficiency at heavy con- 
centrations. 

The low growth temperature enabled us to 
grow a heavily doped photoepitaxial layer with 
an abrupt boron profile. This profile could not 
be produced by conventional epitaxy and ion 
implantation. The boron profile for a heavily 
doped photoepitaxial layer with a carrier con- 
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Fig. 22—Boron profile of photoepitaxial layer with 
heavy boron doping grown at 600 °C. The 
epitaxial layer was 100 nm thick and the 
transition width was 35 nm. 


centration of 8 x 10!9 cm™? grown at 600°C 
was measured using SIMS (see Fig. 22). The 
transition width between the p* epitaxial layer 
and p substrate was about 35 nm. 


5. Photoenhanced low-temperature selective 

epitaxy 

Si selective epitaxy has a good potential for 
achieving advanced device structures. Conven- 
tional selective epitaxy is performed using high- 
temperature CVD, which uses SiH2Cl, or HCl 
added to a SiH,/H, gas system as the source 
gas?7)"39) For user gas added, the growth 
temperature must be above 900 °C for selective 
growth with good crystallinity. However, this 
high temperature makes it difficult to achieve 
abrupt impurity profiles due to impurity diffu- 
sion, and it also generates a large facet and 
stacking faults at the Si-SiO, sidewall interface. 
Thus, for a high selective growth rate below 
700 °C, while still retaining good crystal quality 
we used photoenhanced CVD. 

Low-temperature selective epitaxy needs a 
very low pressure, usually below 10 Pa. There- 
fore, selectivity and crystal growth can be 
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greatly degraded by moisture and oxygen 
contamination?!)’??), For low-temperature selec- 
tive epitaxy, an ultra clean deposition environ- 
ment is necessary; we used a specially designed 
CVD system to suppress moisture and oxygen 
contamination. This system had a cold-walled 
quartz CVD chamber that was evacuated to a 
base pressure of 10° Pa at high temperatures 
using an oil-free pumping system, a_ turbo- 
molecular pump, a loadlock chamber, and a 
transport chamber (see Fig. 23). The wafer was 
transported into the UHV without being ex- 
posed to air using magnetic-coupled linear 
transport devices. Using this UHV/CVD cham- 
ber, we tested the pressure dependence of 
selective growth over a wide range of pressures, 
from 1074 Pa to 104 Pa. To reduce contamina- 
tion due to carbon and outgassing from the 
SiC coated carbon susceptor, we directly heated 
the Si substrate from the back using a newly 


a) Thermal growth 


developed Xe long-arc lamp without the SiC- 
coated carbon susceptor. Ths Si substrate 
absorbed the light from the Xe lamps very 
efficiently, causing the substrate temperature to 
rise rapidly. The wafer surface was vertically 
irradiated by UV light from high-pressure mer- 
cury lamps with a UV intensity of 1.2 W/cm? at 


UV lamp unit Gas inlet 
Transfer chamber 


| 


Quartz chamber 


Turbo 
molecular 


b) Photoepitaxy 


Fig. 24—The surface morphgologies of deposited Si films at a substrate temperature of 750 °C. The top photographs 
were taken through a Nomarski microscope, and the bottom photographs were taken through an SEM. 
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the surface of the wafer. The substrate tempera- 
ture was measured using a pyrometer which was 
calibrated by a thermocouple buried in the Si 
wafer, and the temperature rose 50 °C due to 
UV irradiation. To compensate for this increase, 
we decreased the Xe lamp output during UV 
irradiation. To remove the native oxide, the 
substrate was heated in hydrogen ambient at 
800°C and 67 Pa for 10 min without UV 
irradiation. Si films were deposited using a 
Six2He/H2 gas system. The disilane partial 
pressure was varied from 1x10% Pa to 
1.7x 10°? Pa and hydrogen pressure was 
varied from 1.3 x 10°! Pa to 2.6 x 10% Pa. The 
total pressure during film deposition was almost 
equal to the hydrogen pressure. 

At a substrate temperature of 750°C, a 
hydrogen pressure of 67 Pa, anda disilane partial 
pressure of 3.6 x 10°? Pa, the thermal growth 
(without UV irradiation) was nonselective \see 
Fig. 24a)}. In contrast, photoepitaxy clearly 
produced selective growth at 750 °C (substrate 
temperature: 700 °C + UV irradiation increase: 
50°C) {see Fig. 24b)}.The deposited films on 
Si showed mirror-like surfaces for both thermal 
and photoepitaxy growth as shown in Fig. 24. 
However, on SiO,, the thermally grown poly- 
crystalline film had a rough surface due to large 
grains resulting from the high deposition tem- 
perature. 

We investigated the dependence of selectivi- 
ty on deposition temperature both with and 
without UV irradiation. The hydrogen pressure 
was set at 67 Pa, the disilane partial pressure was 
set at 3.6 x 10°° Pa, and the deposition tempera- 
ture was varied from 610 °C to 890 °C. With UV 
irradiation, there was still selectivity until the 
substrate temperature dropped to 700 °C, but 
without UV irradiation, selective epitaxial 
growth required a temperature of 850 °C (see 
Fig. 25). The photoepitaxial growth rates on Si 
and SiO, depended very little on the substrate 
temperature throughout the range from 610 °C 
to 890 °C. 

We also studied the dependence of Si film 
thickness on Si and SiO, on the deposition time. 
At a substrate surface temperatuere of 750 °C, a 
hydrogen pressure of 67 Pa, anda disilane partial 
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Fig. 25—Deposition temperature dependence of deposi- 
tion rate on Si and SiO, with or without UV 
irradiation. 
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Fig. 26—Deposition time dependence of deposited Si 
film thickness at 750 °C. 


pressure of 3.6 x 10°? Pa, Si film could not be 
deposited on SiO, photoepitaxially, but could 
be deposited by thermal growth, and there was 
an incubation period for Si nucleation on SiO, 
(see Fig. 26)**. The growth rates on Si by 
photoepitaxy and by thermal growth were 
almost equal, because in this temperature range 
of 610 °C to 890 °C, disilane mainly dissociates 
thermally. However, at a substrate surface tem- 
perature of 600°C, Si film was deposited on 
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Fig. 27—Deposition time dependence of deposited Si 
film thickness at 600 °C. 


Si and SiO, by both photoepitaxy and thermal 
growth (see Fig. 27). The thermal growth was 
much lower than the photoepitaxy rate for 
both Si and SiO, because at this temperature, 
disilane mainly dissociates photochemically. For 
thermal growth, there was an incubation period 
for Si nucleation on SiO,, but this incubation 
period disappeared with UV irradiation. 

From these results, we conclude that UV 
irradiation below 600°C enhances disilane 
decomposition in the vapor phase, increasing the 
concentrations of the reactive species. For UV 
irradiation at 600 °C, the growth rate increases 
and the incubation period of Si nucleation on 
SiO, decreases. Above 600 °C, disilane mainly 
decomposed thermally. UV irradiation enhances 
the desorption of adsorbed silylene species on 
SiO, , lowering the selective growth temperature. 

Selectivity also depends on the growth rate, 
which in turn depends on the disilane partial 
pressure (see Fig. 28). At a substrate surface 
temperature of 750 °C and a hydrogen pressure 
of 67 Pa with UV irradiation, selective growth 
was achieved at a disilane partial pressures below 
1 x 10°? Pa. This corresponds to a growth rate 
of 15nm/min. But under these conditions, 
thermal growth could not produce selective 
growth. The epitaxial layer grows on Si by 
photoepitaxy and by thermal growth at almost 
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rate on Siand SiO, with or without UV 
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Fig. 29—Relationship of hydrogen pressure and selec- 
tivity. Substrate temperature was 670 w # 


the same rate because disilane mainly decom- 
poses thermally. 

Selectivity depends on yet another factor, 
hydrogen pressure, which is almost equal to the 
total deposition pressure (see Fig. 29). The 
substrate surface temperature was 670 °C and 
the disilane partial pressure was 3.6 x 107° Pa. 
At hydrogen pressures above 10 Pa, selective 
growth did not occur. The growth rate of the 
Si film on Si and SiO, increased as the hydro- 
gen pressure increased, but at hydrogen pres- 
sures below 10 Pa, selective growth was observed; 
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the growth rate of the epitaxial layer on Si did 
not depend on hydrogen pressure. 

The selective epitaxial layer grew thermally 
at 850°C, and showed large facets and high 
density microfacets at the Si-SiO, sidewall 
interface. The selective epitaxial layer grown 
photoepitaxially at 720 °C showed small facets 
and microfacet formation at the Si-SiO, inter- 
face. 


6. Application for bipolar devices 

It was possible to create a heavy boron 
doped epitaxial layer with an abrupt impurity 
profile using photoepitaxy. We used this layer 
for the base region of a high-speed bipolar 
transistor? *5), Scaling of bipolar transistors 
requires a very thin base with a high doping 
concentration to achieve high-speed operation 
while suppressing base punchthrough. Epitaxial- 
ly grown bases are most suitable for this, and we 
describe here the construction of epitaxially 
grown base transistors (EBT) and the character- 
istics of EBTs and ICs. 

The EBT was made using a non-selfaligned, 
double poly-Si process. A 0.5-um rule lithography 
was used to reduce parasitic capacitances, and 
transistors were built on n-type conventional 
epitaxial layers with 0.09 Q-cm resistivity and 
1.0-um thick. They were isolated by a poly-Si 
filled U-shaped groove which reduced the 
collector-substrate capacitance. To form the 
base region, photoepitaxial layers with boron 
concentrations of 3 x 10!® cm™3 were grown at 
600 °C using photoenhanced low-temperature 
epitaxy. The emitter region was covered with an 
oxide/nitride/oxide pad and 250 nm poly-Si was 
deposited as the extrinsic base electrode. Boron 
ions were implanted into the poly-Si base 
electrode until its sheet resistance reached 
100 /O. From the mask size of 0.5 um, the 
completed emitter was decreased to only 0.25-um 
wide using the sidewall oxide and reactive ion 
etching (see Fig. 30). To form a very shallow 
emitter junction, we used rapid thermal anneal- 
ing (RTA) to diffuse arsenic from the emitter 
poly-Si to the epitaxial base layer. By using the 
epitaxial base technique and RTA _ emitter 
driving, we were able to specify the intrinsic 
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Fig. 31—Intrinsic transistor profile (SIMS and simula- 
tion) for 80 nm photoepitaxially grown base 
thickness. 


base impurity profile with great flexibility and 
accuracy because the base width and base peak 
concentrations can be controlled independently. 
To produce both a low emitter resistance and 
shallow emitter junction depth, we used RTA 
at 1100°C for 2s. The measurements of the 
impurity profiles of the EBT agreed well with 
results calculated using a modified SUPREM 3 
simulation (see Fig. 31). These results clearly 
show that a shallow emitter junction just 35-nm 
deep was achieved. We could reduce the base 
width only by decreasing the deposited epitaxial 
base layer thickness, we succeeded in reducing 
the base width to 32 nm for a 45 nm epitaxial 
base layer thickness. 

All the transistors we fabricated had base 
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Fig. 32—Gummel plot of 80 nm photoepitaxially grown 
base thickness EBT. 
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Fig. 33—Current gain vs. collector current characteristics. 


widths of 31, 46, or 66 nm and all maintained 
BV ceo above 3.0 volts, despite the thin base and 
high collector concentration of 1 x 10!7 cm’>. 
The base region was not susceptible to base 
punchthrough since the epitaxial base layer was 
doped to 3 x 10'® cm~3. However, BVey,_ in- 
creased as the epitaxial base layer thickness 
decreased because the base peak concentration 
was reduced due to emitter driving. At the 
limit therefore, an EBT with 45 nm epitaxial 
layer thickness and 32 nm base width showed 
slight base punchthrough. The base current in a 
Gummel plot shows ideal characteristics through 
a current range over more than 4 decades 
(see Fig. 32). This proves that the photoepitaxial- 
ly grown base layer has good crystallinity with 
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Fig. 34—Cutoff frequency as a function of collector 
current. 


no defects to generate recombination centers. 
The current gain was 100 and 150 for base 
widths of 66 nm and 46 nm, respectively, and 
increased to 250 as the base width decreased to 
31 nm. Current gain was independent of collec- 
tor current over a wide range (see Fig. 33). 

The frequency response of the EBT is pri- 
marily limited by the base transit time. The 
cutoff frequency improved from 32 GHz to 
43 GHz as the base width was reduced from 
66nm to 32 nm (see Fig. 34). At higher fre- 
quencies, however, the base transit time no 
longer dominates. A 32 nm base is thin enough 
to make the base transit time relatively small 
compared with other components. To improve 
the frequency response further, the parasitical 
capacitances and resistances must be reduced 
and the collector impurity profile should be 
optimized. 

Conventional ECL ring oscillators were built 
using the EBT. The minimum gate delay times 
for the 66 nm base width EBT was 67 ps for a 
low power gate (1.4 mW/gate) and 58 ps for a 
high power gate (7.5 mW/gate). However, the 
gate delay time deteriorated as the base width 
decreased (see Fig. 35). The cutoff frequency 
and the gate delay time appeared to have op- 
posite effects depending on the base width, due 
to the base resistance. A 66 nm base EBT had 
an intrinsic base resistance (Rpj) of 26kQ and 
an AC base resistance (7p, ) of 180 Q. Both base 
resistances increased as the base width decreased. 
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Fig. 35—Power-delay characteristics of ECL gates. 


Table 1. Intrinsic base sheet resistance and AC base 
resistance of EBT 


Photoepitaxial layer thickness 


As grown sheet 
resistance (kQ/D) 


Intrinsic base 
resistance 


(kQ/O) 
AC base resistance (Q) 


The Ry; of the 32 nm base EBT became so large 
that measurement using pinch resistance pat- 
terns was impossible (see Table 1), and such a 
high base resistance limited the performance of 
the ECL. These high resistances were caused by 
a thin rink base (under the oxide/nitride/oxide 


pad) and relatively low boron doping concentra- 
tions of 3 x 1018 cm”3. This resistance could be 
lowered further only by increasing the base boron 
doping concentration up to 6x 10!® cm. 
Simulation predicts that gate delay times below 
30 ps are possible for a base boron doping con- 
centration of 6x 10!8 cm. We successfully 
integrated the EBT into a small-scale integrated 
circuit, and made a 1/8-divider using T-F/F 
circuits. We operated the circuit up to 11.7 GHz 
input frequency with a total power dissipation 
of 440 mW (see Fig. 36). 


7. Conclusion 
We have described low temperature epitaxial 
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Fig. 36—Input and output waveform of 1/8-divider 
fabricated using EBT. 


growth techniques using photoenhanced proc- 
esses. At atmospheric pressure, UV irradiation 
decreased epitaxial growth temperature to 
630°C, and reduced the activation energy. 
However, the growth rate below 700 °C was not 
fast enough to grow thick epitaxial films. To 
improve the growth rate at low temperature and 
to lower the epitaxial growth temperature, we 
tried photoepitaxy at reduced pressure. The 
photochemical effect on the disilane vapor- 
phase reaction was greatly improved by reducing 
deposition pressure. Consequently, a reduction 
in epitaxial growth temperature of about 90 °C, 
was achieved, pushing the temperature down to 
540 °C, and the growth rate below 700 °C was 
much higher for photoepitaxy at reduced 
pressure. 

Photoepitaxy also enhanced boron doping at 
low growth temperatures. With UV irradiation at 
600 °C, heavy boron doping of 1.5 x 10?° cm™ 
produced superior crystal quality and very 
abrupt boron profiles. To control precisely the 
photoepitaxial surface reactions, we constructed 
a special photoepitaxial chamber evacuated to 
a UHV base pressure to suppress moisture and 
oxygen contamination. Using this chamber, we 
were able to achieve selective photoepitxy at 
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low temperatures with small facets and few 
defects at the Si-SiO, interface. We applied a 
heavily boron doped photoepitxial layer to the 
base region of a high-speed bipolar transistor, 
and produced a cutoff frequency of 43 GHz 
using a very thin base width of 31 nm. A fre- 
quency divider with an operating frequency of 
11.8 GHz was made using a photoepitaxially 
grown base transistor. We have thus demon- 
strated the suitability of photoenhanced low- 
temperature epitaxy for LSI wafer processes of 
the future. 
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A photoexcited dry cleaning process using highly purified chlorine gas was developed as an 
ultraclean process for silicon ultra-large scale integrated-circuit (ULSI) production. The 
process aids in deep-submicron device fabrication. Metallic contaminants remaining on a 
wafer’s surface after wet cleaning can be significantly reduced by photoexcited cleaning. 
Photoexcited cleaning can also remove contaminants that have penetrated the silicon sub- 
strate during reactive-ion processing by etching the contaminated layer without damaging 
the uncontaminated layer. Thus, dry cleaning improves thin gate oxide integrity, SiO2-Si 
interface quality, carrier lifetime, junction leakage characteristics, and epitaxial film quality. 


1. Introduction 

Contaminants such as alkali metals, heavy 
metals, and hydrocarbons introduced onto 
silicon surfaces during device manufacture can 
degrade both device performance and reli- 
ability’)’??. Although the use of highly pure wet 
chemicals, resists, and gases can alleviate the 
problem, processes causing surface, contamina- 
tion and damage such as ion implantation, dry 
etching, and resist ashing are frequently used 
when manufacturing ULSI devices. Thus, clean- 
ing the wafer surface has become an important 
part of ULSI processing. Wafers are cleaned 
prior to thermal oxidation, impurity diffusion, 
epitaxial growth of silicon films, chemical vapor 
deposition, and other thermal processes. 

Currently, wet cleaning processes are prima- 
rily used; these are useful for removing large 
particles and other contaminants remaining on 
wafer surfaces after such processes as dry etch- 
ing and ion implantation. 

ULSI processing requires very clean wafer 
surfaces. For example, for a dynamic random 
access memory (DRAM), surface cleanliness 
determines the refresh time, which should 
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increase as the degree of integration increases. 
Photoexcited dry cleaning using reactive 

gases improves upon wet cleaning and other dry 
cleaning techniques such as reactive-plasma or 
reactive-ion cleaning, both of which can damage 
and contaminate a wafer’s surface. Some of the 
advantages of photoexcited dry cleaning over 
conventional wet cleaning are as follows: 

1) As devices become smaller, it becomes 
harder for wet chemicals to clean surface 
anomalies such as contact holes and deep 
trenches. Photoexcited radicals can clean 
these anomalies. 

2) Gases can be cleared from a chamber faster 
and easier than wet chemicals. Less recon- 
tamination occurs on clean surfaces. 

3) The process chambers are similar to those 
used in CVD or dry etching, making the 
equipment compatible with already-establish- 
ed automated ULSI production lines. 

4) The exposure atmosphere is controlled, and 
film deposition can be conducted without 
exposing the wafer to air by connecting the 
process chambers to each other. Thus, high- 
quality gate oxide and epitaxial films 
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become easier to fabricate. 

5) The amount of chemicals required is usually 
much smaller for gas cleaning, making the 
system and process more economical. 

6) Dry cleaning, of course, does not require a 
drying process, which often causes such 
problems as watermarks. 

Photoexicted dry cleaning using ozone is 
useful for removing organic particles. For 
inorganic materials such as metals, photoexcited 
halogen radicals are effective. Combining photo- 
excited and wet cleaning significantly improves 
cleanliness?’ . 

This paper discusses the procedure and elec- 
trical properties of photoexcited dry cleaning 
using halogen, and discusses the advantages of 
this method over wet chemical cleaning for 
ULSI processing. 


2. Cleaning mechanisms 

The reaction mechanisms in photoexcited 
dry cleaning using halogen radicals are shown in 
Fig. 1. The halogen gas is dissociated to atomic 
radicals by ultraviolet irradiation. Chlorine gas 
was initially used as the reactive halogen gas 
because it is dissociated by the ultraviolet light 
of both high and low pressure mercury lamps. 

Figure 2 shows the absorption spectrum of 
chlorine®? and the emission spectrum of a 
microwave-excited high-pressure mercury lamp. 
Any radiation having a wavelength between 
250 nm and 400 nm contributes to the dissoci- 
ation of chlorine molecules. The chlorine 
molecules dissociate into highly reactive atomic 
radicals which bond with the silicon and metal 
contaminants on the wafer surface. 

The metal contaminants are removed from 
the surface as chlorides. There are two desorp- 
tion mechanisms in photoexicted dry cleaning 
(see Fig. 1). The first mechanism is the evapora- 
tion of chloride compounds (MCl,). In this 
mechanism, chlorine radicals react with con- 
taminative metal atoms to provide volatile 
chlorine compounds. The second is the lift-off 
mechanism, in which electron-hole pairs are 
generated® during irradiation by ultraviolet 
light. Chlorine radicals adsorbed on silicon 
surfaces receive electrons and change to nega- 
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Fig. 1—The two desorption mechanisms in photoexcited 
dry cleaning: direct vaporization of chloride 
compounds, and lift-off. 


Cl, absorption spectrum 


Emission spectrum 


Intensity (arb. units) 


250 300 350 400 450 500 
Wavelength (nm) 
Fig. 2—Absorption spectrum of chlorine>) and emission 
spectrum of a microwave-excited high-pressure 
mercury lamp. Light between 250nm and 
400 nm contributes to the dissociation of chlorine 
molecules. 


tively charged chlorine ions. The positively- 
charged silicon substrate attracts the chlorine 
ions producing silicon-chloride compounds, 
while the high vapor pressure causes spontane- 
ous etching. Metal contaminants are lifted off at 
the same time as the silicon is etched. While the 
vapor pressure of such metallic chlorides of Ti, 
Al, and Fe is relatively high, the vapor pressure 
of K, Na, Mg, and Ni is low”). The former can be 
removed by direct vaporization, while the latter 
must be removed by lift-off. 


3. Cleaning procedure 


The wafer is placed in a quartz chamber, and 
heated from the back by an infrared lamp in a 
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high-purity chlorine atmosphere. A microwave- 
excited mercury lamp irradiates the wafer from 
directly above. Figure 3 shows the dependence 
of the etch depth on the ulraviolet light irradia- 
tion time at a substrate temperature of 150°C?). 


1 000 


_ 
uo 
o 


500 


Etching depth (nm) 


250 


0 5 10 15 20 
UV irradiation time (min) 


Fig.3—Dependence of etching depth on ultraviolet light 
irradiation time. Etching depth is proportional to 
irradiation time”). There is no delay at the 
beginning of etching. 


Treatment 


Surface morphology 


HNO,hot 


The ultraviolet intensity was 22 mW/cm? over 
the wavelength range from 200 nm to 350 nm. 
The figure shows that etching occurs only when 
the wafer is irradiated®) . 

However, native oxides are not removed by 
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Fig. 4—Depth profiles of wafer after photoexcited dry 
cleaning and deposition of 300 nm polysilicon 
layer. 


NH,OH ~ HNO; hot 
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Fig. 5—Surface morphology after various types of wet cleaning followed by photoexcited dry cleaning as observed by 


transmission electron microscope (TEM)?). 
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photoexcited dry cleaning using chlorine. 
Figure 4 shows the secondary ion mass spec- 
trometry (SIMS) depth profiles of oxygen and 
chlorine in cleaned samples onto which a poly- 
silicon layer was deposited”. The figure shows 
that large numbers of oxygen and chlorine 
atoms accumulate at the silicon-polysilicon 
interface. The concentration of oxygen at the 
interface is estimated to be 10? cm™°. Since the 
partial pressures of both oxygen and water are 
less than 1075 Pa in the chamber, such a high 
oxygen concentration cannot be caused by a 
mechanism such as oxidation after cleaning. 
Thus, we conclude that the native oxide remains 
during photoexcited cleaning. From Auger 
analysis, it was found that the hydrocarbon 
residue’s signal was smaller than the signal after 
wet cleaning”. 

Despite the native oxide on the surface, 
etching is not delayed (see Fig. 3). The photo- 
excited chlorine radicals probably diffuse 
through the thin native oxide and react with 
silicon atoms to produce volatile species like 
SiCl, and SiCl,. Most likely, the chlorine radicals 
react with contaminants in the native oxide and 
with contaminants near the silicon surface. 
Reaction products diffuse out through the 
native oxide and are discharged. 

The process by which a silicon wafer is 
etched by photoexcited dry cleaning greatly 
depends on the surface conditions. Differences 
were observed between surfaces after various 
wet treatments followed by photoexicted dry 
cleaning that were not observed just after wet 
treatments. Figure 5 shows the surface mor- 
phology after seven types of wet treatment 
followed by photoexcited dry cleaning as 
observed by transmission electron microscope 
(TEM)??. The process parameters of the wet 
treatments are shown in Table |. All wafers were 
treated with diluted HF and rinsed in deionized 
water before wet treatment, and were rinsed 
in deionized water after wet treatment. Treat- 
ments using NH,OH, HCl, or H,SO, produce 
very uniform silicon surfaces after photoexcited 
dry cleaning. This contrasts with the surface 
after HNO, treatment. While treatment with 
HNO, roughens the surface, the surfaces after 
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Table 1. Process parameters of wet cleaning shown in 


Fig. 6 
ee 
Test as Temp. Time 
No. Schution Composition °c) (min) 
3 HF 
5 1 
: ane 100 H,0 2 
4. + 
1 NH,OH 
2 NH, OH 1.4 H, O, 63-80 10 
4H,O 
1 HCl 
3 HCl 1 H, 0, 37-65 10 
4H,0 
4 H,SO,4 
4 H,SO,4 1 H,0; 90-85 10 
5 HNO; hot HNO; 45-60 5 
6 HNO; boil HNO3 115-125 | 5 
NH,OH > 
No. 2 > No. 
T | wanes hor Oye 


200 


Etch rate (nm/min) 
Ss 


50 


Native oxide thickness (nm) 


0 
HF NH,OH HCl H,SO, HNO; HNO; NH,OH > 
hot boil HNO,hot 


Fig. 6—Native oxide thicknesses and photoexcited 
etching rates of various types of wet cleaning 


3) 


treatment with NH,OH and HNO are smooth. 
Treatment using HF solution causes long periods 
of roughening. 

Figure 6 shows the native oxide thicknesses 
and photoexcited etching rates of various types 
of wet cleaning. The thicknesses were measured 
using an ellipsometer”?. The surface morphology 
after photoexcited dry cleaning reflects not only 
the thickness of the native oxides after wet 
cleaning but also the film quality. While the 
native oxide after HNO 3 cleaning is only slightly 
thicker than after NH,OH, HCl, or H,SO, 
cleaning, surfaces treated with HNO, differ 
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from surfaces treated with other solutions. 
After HNO, treatment, very thin native oxides 
disturb the chlorine radical’s diffusion into the 
surface, causing etch pits to form on the wafer’s 
surface. 

The chemical structures of the native oxides 
formed during the wet chemical treatments 
described above were investigated by non- 
destructive measurements of the Si 2p photo- 
electron spectra. We could not determine the 
native oxide structures, only the amounts of 
suboxides in the native oxides. There was a 
clear difference in the distribution of Si?* , while 
the amounts of suboxides were almost the 
same for all the treatments investigated. Treat- 
ment with HNO; resulted in localization of Si** 
at the silicon-oxide interface!®. Si** is correl- 
ated with surface roughness produced by photo- 
excited dry cleaning. 


4. Estimation of cleanliness 

The residual concentrations of various 
elements on the silicon surface were examined 
by atomic absorption spectrophotometry (AAS). 
The native oxide on the cleaned surface was 
vapor etched by HF, and the contaminated HF 
solution was analyzed. Photoexcited dry clean- 
ing by chlorine was carried out using a clean 
quartz chamber at 150 °C and 2.7 kPA. Special 
chlorine gas, 99.999 percent pure, was required. 
Figure 7 compares the surface cleanliness 


10" 


Concentration (atoms/cm?) 


101° 


Wet Photoexcited 


cleaning dry cleaning 


Fig. 7—Surface cleanliness after wet cleaning and after 
wet cleaning followed be etching to 60 nm by 
photoexcited dry cleaning!). The silicon was 
not intentionally contaminated. 
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after wet cleaning alone (i.e. diluted HF, 
NH,OH/H,0,/H,0, hot HNO, cleaning, and 
deionized water rinsing) with the cleanliness 
after wet cleaning followed by etching to a 
depth of 60 nm by photoexcited dry cleaning!” 
The figure shows that the etching reduced the 
concentrations of Fe, Mg, Ca, and Na. Both wet 
and photoexcited dry cleaning left no detectable 
traces of other elements such as Ni, Cr, Al, Cu. 

SIMS was used to observe metal contami- 
nants on the surfaces of these wafers after 
silicon epitaxy. Parts of the wafers were etched 
to a depth of 20 nm by photoexcited dry clean- 
ing. Then, hydrogen gas annealing was carried 
out at 980 °C for 15 min, followed by epitaxial 
silicon deposition at 800 °C in the chamber used 
for photoexcited dry cleaning. Only K, Na, and 
Al atoms were detected in a search for K, Na, 
Al, Fe, Ni, Co, and Cu at the interface (see 
Fig. gy)’. The K atoms were removed by photo- 
excited dry cleaning. In addition, Na also fell 
below the limit of detection after photoexcited 
dry cleaning. 

Since the chlorides of K, Na, Mg, and Ca are 
less volatile, and since these metals and Fe were 
removed, we postulated the lift-off cleaning 
model described in the previous chapter. 

Intentionally contaminated wafers were used 
to clarify the effect of cleaning on particular 
elements. The cleaning characteristics for Fe, the 
most populous element contaminating ULSI 


==] : Thermal treatment 


na] : Photoexcited dry cleaning 


and thermal treatment 


Ion counts (arb. units) 


Fig. 8—Effects of thermal treatment at 980 °C with and 
without prior photoexcited cleaning. Following 
the thermal treatment, a 1 a epitaxial layer was 
grown at 800 °Cin Si, He 2 
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Fig. 9—Fe concentration on silicon surfaces before and 
after photoexcited dry cleaning!?), 
Surfaces were contaminated by immersion in a 
NH, OH/H, 0,/H,0O solution containing 50-ppb 
Fe and then etched by photoexcited dry cleaning. 
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Fig. 10—XPS spectrum of Fe 2p3/2 region for silicon 
surface with Fe contaminants!”), 


processes, was examined. After HF pretreatment, 
wafers were contaminated by immersion in a 
NH, OH/H, O,/H, O solution containing Fe. The 
wafers were then cleaned by photoexcited dry 
cleaning. The Fe concentration was analyzed by 
AAS. Figure 9 shows the dependence of Fe 
concentration on the etch depth for photo- 
excited dry cleaning'?). The figure shows that 
the surface concentration of Fe is lowered by 
silicon etching and is temperature dependent. 
At 170 °C, the Fe concentration was lowered to 
about one percent of its initial value, and was 
below the detection limit. At 140°C, the 
Fe concentration was only lowered to about 
10 percent of its initial value. 
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The Fe 2p3/2 spectrum for a contaminated 
silicon surface as measured by X-ray photoelec- 
tron spectroscopy (XPS) is shown in Fig. 10'??. 
The ordinary Fe peak at 706eV has been 
shifted up by 5eV to 711 eV. Contaminating 
iron atoms bond with oxygen atoms in the 
native oxide. Since the iron is removed by 
photoexcited dry cleaning, as shown in Fig. 9, 
the iron in the native oxide is easily unbound. 


5. Electrical Properties 

Metal contaminants remaining on the silicon 
surface degrade the electrical characteristics of 
fabricated devices, and decrease carrier lifetimes 
in the silicon. These contaminants can be 
attributed to resist contamination and process 
environment contamination. 


5.1 Dielectric characteristics of oxides 

Dielectric breakdown fields are greatly 
influenced by metal contaminants!*’!”., Time 
zero breakdown fields and the time dependent 
dielectric breakdown (TDDB) characteristics of 
oxides were investigated after photoexcited 
cleaning. 

MOS capacitors were fabricated on (100)- 
oriented p-silicon wafers having a resistivity of 
10 82-cm. The wafers were wet-cleaned using 
diluted HF treatment, NH,OH/H,0,/H,0O and 
hot HNO, cleaning, and deionized water rinsing. 
The wafers were then dry-cleaned by photo- 
excitation. Some wafers were only wet-cleaned. 
The silicon was etched to a depth of 30 nm by 
the photoexcited dry cleaning. A gate oxide of 
16 nm was grown at 1 000 °C in dry oxygen and 
aluminum electrodes were deposited by evapora- 
tion. Figures 11 a) and 11 b) show histograms of 
the dielectric breakdown fields of the MOS 
structures when a negative bias was applied to 
the electrodes and the area of each electrode was 
0.16 cm?. The breakdown field was defined as 
the field induced when the gate current density 
reached 3x10 A/cm? !". The dielectric 
breakdown distribution of dry-cleaned samples 
was sharper than that of wet-cleaned samples 
and the field strength was higher. The increase 
near the intrinsic breakdown value after wet 
cleaning is due to metallic contamination. The 
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Fig. 11—Histograms of dielectric breakdown fields for 
MOS structures having a 16 nm oxide layer!) 
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Fig. 12—TDDB characteristics of the oxide layer. 


peak value of the breakdown field after photo- 
excited dry cleaning exceeded 10 MV/cm. 
Figure | 2 shows the results of TDDB measure- 
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Fig. 13—Quasi-static C-V curves!5), 


ments when a constant field of —10 MV/cm was 
applied to the gate. The figure shows that 
photoexcited dry cleaning increased the life- 
times of oxide films. Improvements in the oxide 
TDDB characteristics after photoexcited dry 
cleaning is also due to the removal of metal 
contaminants from the surface. 


5.2 Silicon-oxide interface characteristics 

The flat-band voltage and the interface state 
density of aluminum-gate MOS capacitors having 
a 16-nm-thick oxide layer were estimated from 
high-frequency and quasi-static C-V_ curves. 
Fowler-Nordheim carrier injection was_ per- 
formed using a negative bias applied to the gate 
electrode and a constant current. The current 
density was 2.6 x 1075 A/cm? and the injection 
time was 500 s. 
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Before Fowler-Nordheim injection, there is 
only small difference between the high-frequency 
and quasi-static C-V curves for wet cleaning and 
wet cleaning followed by photoexcited dry 
cleaning. However, a clear difference appears 
after Fowler-Nordheim injection. Figure 13 a) 
shows the quasi-static C-V curves after wet 
cleaning, and Fig. 13 b) shows the curves after 
wet cleaning followed by photoexcited dry 
etching to a depth of 6 nm!®?. The shift in the 
flatband voltage Vg during Fowler-Nordheim 
injection stress decreased to 1/5 of its original 
value after the photoexcited dry cleaning. The 
interface state density of the mid-gap Di after 
Fowler-Nordheim injection also decreased from 
8x10" cnr eV «to 2.5% 10*? env? ey” 
after the photoexcited dry cleaning. 

The decrease in surface generation velocity 
improves the Fowler-Nordheim injection charac- 
teristics and was calculated from the MOS 
Zerbst plots. The C-t curves were measured at 
50°C. Figure 14 shows the dependence of the 
surface generation velocity!» on the etching 
depth. The deviation in velocities after etching 
to a depth of 60nm by photoexcited dry 
cleaning was also decreased. 


5.3 Carrier lifetime and junction leakage 
characteristics 
The carrier generation lifetimes were evalu- 
ated using their dependence on the etching 
depth of silicon. MOS structures having a 20 nm 
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Fig. 14—Dependence of surface generation velocities on 
photoetching depth! 1) 
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gate oxide layer and aluminum electrodes were 
fabricated on (100)-oriented n-type silicon 
wafers having a resistivity of 1 Q-cm. The life- 
times were derived from the Zerbst plots at 
50°C. The lifetime increased to 2.5 ms for an 
etching depth of 50 nm (see Fig. 15)*. After 
conventional wet cleaning the lifetime was 
1.6 ms. The longer lifetime improves the junction 
leakage characteristics and is due to decreases in 
the amount of heavy metal contaminants. 
Numerous metal contaminants adhered to 
the silicon surface during dry etching!). The 
junction leakage characteristics after photo- 
excited cleaning were estimated for the samples 
fabricated by dry etching a field oxide layer. 
The n* p junctions were fabricated on (100)- 
oriented p-silicon wafers having a resistivity of 
1092-cm. The 600-nm field oxide layer was 
etched by reactive ion etching using a 50%-CF, / 
50%-CHF; gas mixture. Conventional RCA 
wet cleaning was done. The cleaning consisted 
of washing in NH,OH/H,0,/H,O, diluted 
HF, HCl/H,0,/H,0O, and then submerging in 
NH, OH/H, O,/H, O after RIE of the oxide layer. 
After wet cleaning, some wafers were cleaned by 
photoexcited dry cleaning in a 2.7 kPa chlorine 
atmosphere at 170 °C. The silicon was etched to 
a depth of 47 nm, then a 5 nm oxide layer was 
grown at 800 °C in dry oxygen. An arsenic con- 
centration of 4x 10!5/cm? was implanted 
through the oxide layer at 30 keV. The silicon 
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Fig. 15—Dependence of carrier generation lifetime on 
photoetching depth®). 
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Fig. 16—Reverse I-V characteristics of n° p junction for 
RIE samples. Cleaning was performed prior to 
oxidation and arsenic ion implantation!®, 


substrates were annealed at 1 000 °C for 20 min 
in a nitrogen atmosphere to activate the arsenic. 
Then, the oxide films were removed using a 
diluted HF solution, and aluminum electrodes 
containing silicon were sputter-deposited. The 
wafers were annealed at 450 °C for 30 min in a 
forming gas (N,/H;). 

Figure 16 shows the reverse I-V_ charac- 
teristics of the n+p junctions after wet cleaning 
and after wet cleaning followed by photoexcited 
dry etching to a depth of 47 nm for a junction 
area of 1.13 x 10°? cm? '®. Stray currents 
often appeared after wet cleaning, but did not 
appear after photoexcited dry etching. The 
temperature dependence of leakage currents was 
measured from 10°C to 100°C at 5 V reverse 
bias. Figure 17 shows Arrhenius plots of the 
temperature dependence of the junction leakage 
currents for RIE samples after wet cleaning, and 
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Fig. 17—Temperature dependence of junction leakage 
currents at reverse bias of 5 V for RIE 
samples!®, 
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Fig. 18—SIMS depth profiles of Ni in RIE samples!® 


after wet cleaning followed by photoexcited dry 
cleaning. From the slope of the curves, we 
suspect that the diffusion currents control the 
leakage current above 40°C, and that the 
currents generated by metal contaminants in the 
depletion layer control the leakage current 
below 40°C!”. The currents generated after 
wet cleaning were stronger and deviated more 
than those generated after photoexcited dry 
cleaning. This suggests that photoexcited dry 
cleaning removes many of the metal contami- 
nants. 

The depth distribution of contaminants was 
measured by SIMS. There was no clear trace over 
aluminum, so it must have been removed by the 
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hot HNO, prior to SIMS analysis. A search was 
made for contaminants such as Fe, Ni, Cr, and 
Cu, but only Ni was detected. Figure 18 shows 
the depth profiles of Ni. After wet cleaning, Ni 
was detected within 30 nm of the silicon sur- 
face!®. By comparing these samples with the 
control samples, the peak concentration of Ni 
was found to be about 5 x 10!8 atoms/cm?. 
The Ni contaminants were removed by photo- 
excited dry cleaning, suggesting that photo- 
excited dry cleaning is sufficient to remove all 
metal contaminants. (High-sensitivity apparatus 
is required to estimate surface cleanliness. ) 

It seems that the large currents found after 
wet cleaning were caused by metal contaminants 
adsorbed by the silicon substrates during dry 
etching. During dry etching, F and C ions 
strike the stainless-steel chamber wall displacing 
metals such as Fe, Ni, and Cr. These contami- 
nants are not eliminated by RCA wet cleaning. 
Photoexcited dry cleaning, however, does 
remove the contaminated layer and therefore 
reduces the leakage currents. Thus, clean silicon 
surfaces can be obtained even if the sllicon is 
contaminated by RIE. Both the temperature 
dependence and SIMS analysis supported this 
hypothesis. 


5.4 Epitaxy 

In silicon epitaxy, etching with HCl and H, 
gas prior to silicon epitaxial growth is often used 
to remove contaminants and native oxides and 
to provide a clean silicon surface. However, this 
method requires a high temperature (about 
1100 °C) to liberate chlorine atoms from HCl 
molecules. Such a high temperature makes 
removal of alkali- and heavy-metal contaminants 
having large diffusion constants nearly impos- 
sible. Photoexcited dry cleaning can be used to 
etch a silicon surface just prior to epitaxy. 

Boron-doped p-silicon wafers having a resisti- 
vity of 15 &-cm and a (100)-oriented surface 
were cleaned by conventional RCA washing 
ending in HCl/H,0,/H,O treatment and deion- 
ized water rinsing prior to photoexcited dry 
cleaning. Some of the wafers were etched to a 
depth of 20 nm during photoexcited dry cleaning 
at a chlorine pressure of 2.7 kPa at 150 °C. 
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Following photoexcited dry cleaning, the thin 
native oxide and the volatile chloride com- 
pounds on the wafers were removed by high- 
temperature treatment. This treatment was 
carried out in the  photoexcited-cleaning 
chamber at between 800°C and 980°C in 
hydrogen at 27 kPa for 15 min. Following ther- 
mal treatment, a 1.0 wm epitaxial p-silicon layer 
was grown at 800 °C in Si,H,, B,H,, and H, 
gases at 27 kPa. 

The native oxide layer containing residual 
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Fig. 19—Chlorine concentrations at the interface and in 
the deposited film as a function of the thermal 
treatment temperature”). Chlorine remaining 
after photoexcited dry cleaning is eliminated at 
900 °C by treatment in hydrogen at 27 kPa. 
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Fig. 20—Oxygen concentrations at the interface and 
in the deposited film”, Oxygen is removed 
from the surface at 980 °C, a necessity for 
epitaxial growth. 
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chlorine remained on the silicon surface after 
photoexcited dry cleaning (see Fig. 4). This 
layer must be removed to obtain a clean silicon 
surface for epitaxy. Figure 19 shows that the 
chlorine concentration at the interface is mini- 
mized at 900°C. Figure 20 shows that the 
oxygen concentration at the interface and in the 
epitaxial film are related and are both minimized 
at 980°C). Thus, hydrogen-ambient thermal 
treatment at 980 °C eliminates volatile chloride 
compounds and the native oxide layer. The 
silicon epitaxial layer was only deposited after 
thermal treatment at 980 °C. An epitaxial wafer 
was cut along the (110) surface and Secco- 
etched for 20s. The cross section was then 
observed by SEM. We observed no etching pits 
at the crystal defects of the interface. 

Epitaxial wafers were oxidized and an alumi- 
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num layer was deposited during the fabrication 
of MOS capacitors. The breakdown fields of the 
20 nm oxide in the accumulation region were 
measured by ramp I-V analysis. The gate area 
was 9 x 10°? cm?. Figure 21 a) shows the distri- 
bution of the dielectric breakdown field after 
thermal treatment at 980°C. Figure 21 b) 
shows the distribution after photoexcited dry 
cleaning followed by thermal treatment?). The 
figures show that photoexcited dry cleaning 
improves the breakdown field and eliminates 
the increase near the intrinsic breakdown value. 
The average breakdown fields were 10.2 MV/cm 
after thermal treatment only, and 11.5 MV/cm 
after the improved process. Because, as was 
previously mentioned, the contaminants were 
removed by photoexcited dry cleaning (see 
Fig. 8), the elimination of the increase near the 
intrinsic breakdown value must be due to the 
decreased incorporation of metal contaminants 
into the oxide. Thus, the new photoexcited dry 
cleaning procedure is an effective way to 
produce epitaxial silicon that is free of metal 
contaminants, 


6. Conclusion 

Photoexcited dry cleaning using chlorine can 
provide ultraclean silicon surfaces, and can 
control the surface conditions prior to film 
deposition. The quality of the oxide layers, 
silicon substrates, epitaxial films, and their 
interfaces are improved after photoexcited dry 
cleaning. The cleaning mechanism, the optimiza- 
tion of the cleaning parameters that affect the 
surface, the possibility of fully-automated 
cleaning units, and the methods of evaluating 
cleanliness should be further investigated. 
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Laser light sources and receivers for the next generation of optical communication systems 
currently being developed in Fujitsu Laboratories Ltd. are reviewed. A modulator integrated 
DFB laser and APD for high-speed systems, and a multielectrode DFB laser and balanced 
receivers for coherent communication systems are described. 


1. Introduction 

In the last decade, much progress has been 
made in the development of laser light sources 
and detectors. Highly reliable Fabry-Perot 
lasers and single longitudinal mode distributed 
feedback (DFB) lasers with a large modulation 
bandwidth have been developed. Avalanche 
photodiodes (APDs) with a gain bandwidth 
(GB) product of up to 30-40 GHz are now 
available. These progresses have made it possible 
to put trunk line systems running at 2 Gbit/s 
into service. Furthermore, the development of 
highly reliable lasers has allowed installation 
of undersea cable systems. 

This has prompted discussion on the next 
generation of systems which will provide a 
variety of services and will be accessible to 
subscribers from the home or office. An exam- 
ple of such a system is the Broadband Integrated 
Service Digital Network (B-ISDN) from NTT. 
Development will start in 1995 and will even- 
tually extend to a nation-wide fiber to the 
home system by the year 2015". In such a 
system, the network of customer premises is 
likely to run at between 100 Mbit/s and several 
Gbit/s, while the required capacity for the 
public access network is estimated to be 
100 Gbit/s??. The trunk line may even require 
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terabit per second capacity. 
One way to realize such high transmission 


capacities is to increase the modulation bit- 
rate of intensity modulation and direct detec- 
tion (IM/DD) to a maximum, then perform 
wavelength division multiplexing (WDM). 
Present electrical multiplexing technology would 
limit the maximum bit-rate to around 10 Gbit/s. 
The other attractive possibility is coherent 
transmission, which allows very long distance 
transmission and very dense optical frequency 
division multiplexing; this gives a huge transmis- 
sion capacity and a multichannel network 
accessible by customers. Optical frequency 
division multiplexing (FDM) of 100 channels, 
each running at 2 Gbit/s with 10 GHz (0.08 nm) 
spacing, will combine to give a capacity of 
200 Gbit/s. 

Enormous efforts will have to be made on 
device development if such systems are to be 
realized. Much greater performance and new 
functions are needed from the light sources and 
receivers. Low cost devices satisfying stringent 
requirements are also essential for the low bit- 
rate customer network. 

In this review, we focus on the light 
sources and receivers for very high bit-rate 
IM/DD systems and coherent transmission 
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systems. The following describes achievements 
made so far and touches on problems yet to be 
resolved. A modulator integrated DFB (MI- 
DFB) laser®? and a high-speed APD* for high- 
speed IM/DD systems, and a new multielectrode 
DFB laser*)’®, balanced detectors”) and an inte- 
grated coherent receiver®) for coherent systems 
are discussed. 


2. Devices for high-speed intensity modulation 
systems 
2.1 Background 

Research and development of semiconductor 
lasers has been directed to attain high-speed 
single mode operation. So far, a single mode 
DFB laser with 3 dB bandwidth of 15 GHz has 
been achieved®)’!°), However, high-speed direct 
modulation of the DFB laser causes wave- 
length chirp, and this limits transmission dis- 
tance due to dispersion in the fiber. It has been 
estimated that the dispersion of a fiber we 
can transmit with penalty less than 0.5 dB is 
200 ps/nm under 3 Gbit/s modulation!’ . This 
corresponds to about 70km for typical zero 
dispersion fiber, and to only 10 km for non- 
dispersion shifted fiber at 1.55 um. The figures 
are even more critical at 10 Gbit/s; for 10 Gbit/s 
modulation, wavelength chirp reaches 1 nm 
(full width at 15 dB). 

To facilitate transmission over 50-100 km at 
10 Gbit/s using typical zero dispersion fiber, the 
wavelength chirp should be below 0.1 nm!!). 
One way to reduce the chirp is to introduce a 
multiple quantum well (MQW) active layer. The 
MQW active layer reduces the refractive index 
change under high-speed modulation and results 
in a smaller wavelength chirp. However, chirp is 
still about 0.5 nm!°). We have therefore taken a 
different approach and introduced external 
modulation. We developed a monolithic light 
source by integrating a modulator and a DFB 
laser into one chip,i.e. a modulator integrated 
(MI-DFB) laser®?. 

On the detector side, an InGaAs/InP APD 
has been developed to take advantage of its 
multiplication gain which gives high sensitivity 
without the thermal noise of an electrical 
amplifier!”). It is necessary to have a large GB 
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product with a bandwidth close to 10 GHz. 
However, in the InGaAs/InP APD, this is diffi- 
cult as holes pile up at the heterojunction’??. 
A large GB could only be obtained at a large 
multiplication factor, and so the bandwidth 
was narrow. We solved this problem by intro- 
ducing a graded layer at the heterointerface. 
We also optimized the field distribution further 
to increase the GB product. 


2.2 Modulator integrated DFB laser 

Figure 1 shows a schematic of the MI-DFB 
laser, a Franz-Keldysh modulator was used. The 
DFB laser wavelength is 1.55 um. For this wave- 
length the composition of the absorption layer 
was selected to be 1.43 um. When a reverse 
biased voltage is applied to the modulator, the 
absorption edge shifts to the longer wavelength 
and the laser light is modulated by absorption. 
The composition of the absorption layer was 


DFB laser 
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Active layer 
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Fig. 1—Modulator integrated DFB laser. 
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Fig. 2—Current versus light output characteristics. 
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adjusted to allow a smaller operating voltage 
with less insertion loss. 

There are three key points to note 
when producing a high-performance integrated 
device. The first one concerns electrical 
isolation between the forward biased DFB 
laser and the reverse biased modulator. The 
isolation resistance must be high, and to achieve 
this, a semi-insulator embedded buried hetero- 
structure (BH) was used. This raised the isola- 
tion resistance to more than 100 kQ. 

The second point concerns the optical 
coupling between the DFB laser and the external 
modulator. To achieve a high coupling effi- 
ciency, a new layer structure was used, with a 
thin InP layer and a guiding layer, common both 
for the DFB laser and the modulator. The 
butt-joint was formed by liquid phase epitaxy 
(LPE) on a thin InP layer. This layer structure 
gives a high coupling efficiency of about 80 
percent. 

The third point concerns the anti-reflection 
(AR) coating on the modulator output facet. 
Light reflected at this facet causes unstable 
operation of the DFB laser. Reflectivity was 
reduced below 0.1 percent by using an AR 
coating with SiN. At the DFB laser facet, a 
high reflectivity (HR) coating with reflectivity 
of about 80 percent was used to increase light 
output from the modulator. 

Figure 2 shows light output power versus 
current. As voltage applied to the modulator 
increases, light output decreases, until the 
reduction saturates at —3 V. There is still some 
output power at this voltage, due to scattered 
light at the butt-joint. When the output power 
was coupled to a single mode fiber, an extinc- 
tion ratio of —13 dB for an applied voltage 
of —3 V was attained. 

Figure 3a) shows the eye pattern for 
10 Gbit/s NRZ modulation, observed by a PIN 
photodetector; a clear eye opening was pro- 
duced. Figure 3b) shows the spectrum for the 
same modulation. The full width at —15 dB of 
the maximum was 0.032 nm, which is drastically 
reduced when compared with direct modula- 
tion of a DFB laser. 
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a) Eye pattern 50 ps 


10 Gbit/s 
: 130mA 


0.032 nm 


Intensity (5 dB/div ) 


Wavelength (0.016 nm/div ) 
b) Time averaged spectrum 
Fig. 3—A 10 Gbit/s NRZ modulation. 


2.3 High-speed APD with optimized field 
profile and graded heterointerface 
The insert of Fig. 4 shows the layer struc- 
ture of the newly developed APD: an InP 
buffer layer, an InGaAs absorption layer, a 
compositionally graded undoped layer, an 
n*-InP electric field falling layer, and an InP 


window layer. The thickness of the InP multi- 


plication layer was controlled by the depth of 
the p* layer by Cd diffusion. An ion-implanted 
Be guardring was used to make the planar junc- 
tion. The graded layer stops holes piling up, and 
the n* field falling layer prevents impact ioniza- 
tion in the InGaAs absorption layer. 
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Fig. 4—Multiplication factor versus bias voltage. 
Inset shows APD layer structure. 
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Fig. 5—Cutoff frequency versus multiplication factor. 


Figure 4 shows multiplication factor as a 
function of bias voltage. The breakdown voltage 
was less than 40 V and at 90 percent of the 
breakdown voltage, the dark current was less 
than 50 nA. 

Figure 5 shows —3dB cutoff frequency 
versus multiplication factor for two samples 
with multiplication layer thicknesses of 0.24 um 
and 0.32 um. The electric field falling at the 
n*-InP layer is 5 x 10° V/cm. For the 0.32 um 
multiplication layer, the GB product was 
70 GHz and the maximum cutoff frequency 
was 9 GHz. 
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Even for a small multiplication factor, the 
bandwidth was large, and this was due to the 
graded layer which prevented hole pile up. 
For a thicker multiplication layer, the GB 
product is large since the overall electric field 
of the diode is lower; GB product increases 
due to avalanche suppression in the InGaAs 
absorption layer. 


2.4 Discussion 

The MI-DFB laser exhibited a very low chirp 
of 0.032 nm and good eye opening at 10 Gbit/s 
modulation, an output power of 10 mW was 
achieved. Using this monolithic device, transmis- 
sion over a dispersion of 230 ps/nm at 10 Gbit/s 
with a penalty less than 0.5 dB is possible!. At 
the lower bit-rate of 2.4 Gbit/s, the limit ex- 
ceeds 2000 ps/nm!!). However, the operating 
voltage still remains rather high at —3 V. This 
operating voltage must be reduced for modula- 
tion using an IC driver. It will be possible to 
reduce it to —2 V by optimizing the composi- 
tion of the absorption layer further, and by 
increasing the power of the DFB laser. The APD 
with a graded layer and an optimized field 
profile gave a GB product of 70 GHz. The large 
9 GHz bandwidth enables this device to be used 
for 10 Gbit/s systems. Even greater optimi- 
zation will make possible a GB product of 
100 GHz. 

The combination of the MI-DFB laser and 
the APD will enable transmission around 70 km 
at 10 Gbit/s using a typical zero dispersion 
fiber. This is a very great improvement over the 
direct laser modulation and conventional APD 
system!?), However, the MI-DFB should be 
compared with a lithium niobate (LN) Mach- 
Zehender modulator!*). The LN modulator can, 
in principle, reduce wavelength chirp to zero, 
and transmit over several hundred kilometers 
using fiber amplifiers. But the MI-DFB laser is 
very much smaller and easier to use, by virtue of 
its monolithic integration of the modulator and 
the DFB laser. The MI-DFB laser can be a very 
practical high-speed light source. 
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3. Devices for coherent systems 
3.1 Background 

Lasers for coherent communication systems 
must satisfy a variety of requirements. The laser 
must be tunable so that the wavelength of the 
transmitter and the local oscillator can be 
adjusted for heterodyne detection. A very 
narrow linewidth is also required. For example, 
for continuous phase frequency shift keying 
(CPFSK), a linewidth less than 10° times the 
modulation bit-rate is necessary'®). Therefore 
for a bit-rate of 1 Gbit/s, lasers with linewidth 
less than 1 MHz must be made. Futhermore 
these lasers must have flat FM response from 
100 kHz to several times the modulation bit- 
rate to get a clear eye opening. For the local 
oscillator, a laser with a higher output power 
is better, to give a larger beat signal for hetero- 
dyne detection. 

It has been very difficult to satisfy all these 
requirements, especially the rather contradictory 
requirements of both narrow linewidth and 
wavelength tunability. For the first time, we 
created such a laser by using a new wavelength 
tuning mechanism: a_ three-electrode, long 
cavity 2/4 shifted DFB laser®)’®. 

For coherent detection, the receiver must 
have new capabilities; a dual-balanced optical 
receiver, using two photodiodes in a balanced 
mixer configuration, is an absolute necessity. 
It cancels excess intensity noise generated by 
the local oscillator laser, and it uses the availa- 
ble signal power efficiently. We integrated two 
finely-balanced PIN photodiodes (twin-PIN 
photodiode) to make a balanced detector, and 
developed a quad-PIN photodiode by combining 
two pairs of twin-PIN photodiodes. This quad- 
PIN photodiode enables polarization diversity 
coherent detection. The balanced PIN-photo- 
diodes were designed in a flip-chip configuration 
to reduce parasitic capacitance. 

To make a more advanced and convenient 
receiver, it will be necessary to integrate the 
optical coupler, balanced detector, local oscilla- 
tor laser, and electrical preamplifier!®. We have 
taken the first step toward trying this integra- 
tion. 
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3.2 Three-electrode, long cavity \/4 shifted DFB 

laser 

Figure 6 shows a schematic diagram of the 
laser. Cavity length is 900 um and an MQW 
active layer is introduced. The long cavity gives 
a large Q-factor and the MQW active layer gives 
a smaller linewidth broadening factor. These 
result in a narrow spectral linewidth. The 
quarter wavelength shift in the grating and 
multiple electrodes provide a means to tune the 
wavelength. We can tune the wavelength by 
changing the current to each electrode. Typi- 
cally, the side electrodes are connected and the 
center current Jc is changed to tune the wave- 
length. This is a new tuning mechanism*’®. The 
quarter wavelength shift causes intense spatial 
hole-burning; the carrier concentration is de- 
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Fig. 8—FM response. 


pressed at the center of the cavity. This carrier 
depression is controlled by selectively injecting 
carriers using the multiple electrodes; this 
charges the refractive index and hence shifts 
the wavelength. 

Figure 7 shows an example of tuning charac- 
teristics. The wavelength becomes longer as the 
center current Jc is increased. The linewidth 
was around | MHz for the entire tuning range 
of 1.13 nm, and it reduced as Jc in creased. 
Output power was high at about 20 mW. 

A particular feature of this tuning mecha- 
nism is that the wavelength shift is a red shift: 
the wavelength becomes longer as the center 
current Jc is increased. In the conventional DFB 
laser, the tuning mechanism is predominantly 
a blue shift, which is opposite to the thermal 
effect, and its FM response dips around 1 MHz 
due to the compensating effects of the thermal 
red shift and the carrier effect blue shift. The 
wavelength tuning of our laser has a red shift, 
and this gives a flat FM response as shown in 
Fig. 8. This gives good eye opening. 


3.3 Flip-chip twin-PIN and quad-PIN photo- 

diode 

Figure 9 illustrates a monolithic twin-PIN 
photodiode with flip-chip configuration. PIN 
photodiodes are back-illuminated with a mono- 
lithic lens to couple the optical fiber. Metal 
bumps surround the photodiodes for flip-chip 
bonding. The photodiode consists of an n-InP 
top layer, an n-InGaAs absorption layer and an 
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n-InP contact layer. The photodiode junction 
has a diameter of 20 um, and was formed by 
selective Zn diffusion; a mesa structure was used 
for electrical isolation. The two microlenses 
were fabricated by Ar ion beam etching. The 
bandwidth of the twin-PIN photodiode was 
13 GHz at 3 dB, helped by the low capacitance 
of 80fF owing to the flip-chip bonding. 
The quantum efficiency was 80 percent and the 
common mode rejection ration (CMRR) was 
—30 dB up to 7 GHz. 

Figure 10 shows how two twin-PIN photo- 
diodes were combined to produce a quad-PIN 
photodiode for polarization diversity coherent 
detection. The two twin-PIN photodiodes were 
integrated monolithically. Figure 11 is a flip- 
chip bonded quad-PIN photodiode. Four mono- 
lithic lenses are visible, and these monolithic 
lenses give a uniform and large photosensitive 
area of 70 um for each diode. Figure 12 shows 
the frequency response and the CMRR for the 
pair of diodes. There is a large bandwidth and 
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Fig. 12—Frequency response and common-mode 
suppression ratio for diode pairs. 


common-mode rejection ratio of —30 dB up to 
10 GHz. The crosstalk between the two twin- 
PIN photodiodes was below —40dB, up to 
14 GHz. 


3.4 Integrated coherent receiver 

Figure 13. shows an attempt to make an 
integrated coherent receiver®). There is a wave- 
guide coupler on one side of the chip and 
balanced PIN photodiodes with interdigitated 
contacts on the other. The waveguide coupler 
and the detectors are optically coupled by a 
monolithic mirror. The waveguide coupler is a 
diluted MQW layer, which gave a low propaga- 
tion loss of 0.5 dB/cm and large numerical 
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Fig. 13—A monolithic receiver. 


aperture (NA). The coupling loss to a single 
mode fiber was 0.9 dB. Recent improvements 
in the PIN photodiode have raised quantum 
efficiency to 85 percent, with a deviation of one 
percent between the two diodes for an applied 
voltage of 5 V. This receiver operated as an 
integrated device, giving a beat signal output. 


3.5 Discussion 

A transmission experiment using the three- 
electrode A/4 shift DFB laser and the twin-PIN 
photodiodes was performed for CPFSK modula- 
tion'”. At 4 Gbit/s, sensitivity was high at 
100 photons/bit, and transmission over 233 km 
was achieved. An experiment on polarization 
diversity transmission was also conducted using 
two twin-PIN photodiodes at 2.5 Gbits/s!”, and 
gave a span loss margin of 53.2 dB and transmis- 
sion over 254 km. The required level of perform- 
ance to conduct the transmission experiments 
was achieved for both the laser and the detector. 

However, there are some problems to be 
overcome to make further progress. For one, 
the tuning range of the laser is rather narrow. 
For optical frequency division multiplexing of 
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100 channels, a local laser with a tuning range 
of 8-10 nm is required. A tuning range of 7 nm 
was recently reported for a similar laser, which 
used a strained MQW active layer!®), This is 
encouraging for increasing the tuning range. The 
quad-PIN photodiode has still be assembled, 
with a polarizer, coupler, and electric pream- 
plifier before it can be implemented in systems. 
A monolithic device in which all of these com- 
ponents are integrated would be most useful. 
Trials for such integration have already started, 
but much work remains to be done before a 
practical integrated device can be built. 


4. Conclusion 

Laser light sources and receivers for the next 
generation of optical communication systems, 
specifically for very high bit-rate IM/DD systems 
and coherent communication systems have been 
reviewed. Using an MI-DFB laser, wavelength 
chirp has been reduced remarkably and a GB 
product of 70 GHz with a 9 GHz bandwidth 
has been attained in an APD. A multielectrode 
DFB laser with a linewidth below 1 MHz, 
1.13 nm tuning range and good FM response 
has been also been achieved. The balanced 
detectors, twin-PIN and quad-PIN photodiodes 
gave a good common mode rejection ratio. 
Trials on an integrated coherent receiver were 
also conducted. The achievements made it possi- 
ble to perform IM/DD transmission at 10 Gbit/s 
over 65 km!®, and coherent transmission at 
4 Gbit/s over 233 km'”). 

These achievements are the first step toward 
the next generation of systems. There remains 
much work to be done to make these devices 
perform satisfactorily. The key to further prog- 
ress is better growth and processing technology. 
For example, the wavelength variance of DFB 
lasers with conventional technology is around 
+5 nm. This is quite insufficient for wavelength 
division or optical frequency division multiplex- 
ing for large capacities. Growth and processing 
technology must be developed to give precise 
control over layer thickness, composition and 
stripe width to reduce wavelength variance. This is 
also true for other integrated devices which 
require precise control of dimensions. To 
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develop the next generation of devices, efforts 
should concentrate on improving growth and 
processing technology, and exploiting the 
structure of these devices. 
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Computer algebra is a newly developing research area that bridges constructive mathematics 
and scientific information processing technology. Research in this area includes design, 
analysis, and prototyping of mathematical algorithms for symbolic and algebraic computa- 


tion. 


This paper describes an experimental system for computer algebra, named risa — Research 
Instrument for Symbolic Algebra. At present, the system forms the basic kernel of a 
computer algebra system, and is expected to become an essential engine for advanced future 
systems. As a general-purpose computer algebra system, risa’s operating speed is competitive 
and sometimes outstanding. This paper also briefly describes several mathematical results of 
some algebraic computations related mainly to polynomials. 


1. Introduction 

Computer algebra systems are powerful tools 
of science and technology. By using a computer 
algebra system, scientists and engineers can fully 
exploit the available computer power to carry 
out the long and tedious processes involved in 
computing large expressions. 

At the beginning of the computer algebra 
era, the 1950’s and 1960’s, the key technologies 
of computer algebra systems were symbolic 
manipulation and artificial intelligence. In these 
decades, it was realized that a purely symbolic 
(i.e. non-algebraic) approach to many simple 
problems tended to explode in terms of time 
and storage. As far as computer algebra is 
concerned, advanced artificial intelligence 
technology based on the simulation of human 
heuristic processes can solve only small sized 
problems that are solvable even by human 
calculation. 

As a consequence, the simple strategy of 
using symbolic manipulation and artificial 
intelligence had to be reconsidered. Modern 
computer algebra systems deeply depend on 
efficient algebraic algorithms (e.g. factorization 
of polynomials) that were developed through 
investigating algebraic structures of mathe- 


338 


matical objects. This relationship is similar to 
the way modern numerical computing tech- 
nology depends on the various numerical 
algorithms designed through numerical analysis. 
The research on these algebraic algorithms is 
called computer algebra. Although symbolic 
technology is still a useful way to improve 
system performance, its importance has de- 
creased. The application of artificial intelligence 
(AI) to practical problems is a long way off. 
AI may help improve the user interface, for 
example, by helping the user to choose the best 
algorithm from several alternatives, but there is 
no known definite choice criterion. 

In the following chapters, we describe our 
experimental system for computer algebra, risa. 
Also, we show several mathematical results for 
efficient computations that are mainly related to 
polynomials. 


2. Risa — an experimental computer algebra 
system 
Risa (Research Instrument for Symbolic 
Algebra) is a computer algebra system that, 
although currently limited, has great potential. 
It was developed and will continue to be 
developed as a practical tool for professional 
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computation in mathematical sciences and 
engineering. At present, risa forms the basic 
kernel of a computer algebra system that realizes 
arbitrary precision integer arithmetic and 
polynomial arithmetic together with a simple 
programming language called asir. By taking 
advantage of its independent open module 
structure, risa will be improved step by step in 
terms of its variety of service functions and 
performance. Risa mostly uses the theoretically 
and/or empirically best algorithms (algebraic 
and symbolic) that are available. 

Computing data and timing data are shown 
in the following sections. All timing data shown 
in this paper was measured on an $4/370 (Sparc 
Station 370) with 32 Mbytes of real memory 
and 64 Mbytes of virtual memory. 


2.1 Design principle of the risa scientific com- 

putation system 

Risa was designed to be an indispensable 
component of future scientific computation 
systems. The technology that is now being 
evaluated is the independent open module 
structure. Risa is a collection of independent 
open modules. Basically, risa is a collection of 
UNIX"? libraries. Each UNIX library (i.e. each 
module) is a set of mutually related functions 
that performs a significant task. For example, 
basic algebraic computation, complex algebraic 
computation, basic numerical computation, 
graph plotting, and programming language 
interpretation and evaluation. Editors, two- 
dimensional formatters, numerical software, 
and high-quality graphic software are not of 
prime interest in our current computer algebra 
research. Therefore, they are not included in 
the current risa system. Regarding the program- 
ming environments, for example, editors, we 
recommend that the user should use the most 
convenient and familiar environment. Regarding 
numerical and graphical facilities, risa will be 
connected to libraries that will be widely 
available in the future. Libraries which are 
connected to risa system must be open to 


Note: The UNIX operating system was developed and is licensed 
by UNIX System Laboratories, Inc. 
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external software just as risa itself is open to 
external software. 

The kind of service a program module 
should provide and the number of functions 
in a program module depend on the following: 
1) Software efficiency 
2) Compactness and completeness of resultant 

software 
3) Ease of software development by research 

groups with limited programming resources 
4) Ease of combining and connecting modules 
developed at independent sites 
5) Portability from small computers to super- 
computers. 

We believe that the independent open 
module structure concept proposed here is a 
good way to integrate large-scale software by 
collecting and combining software developed 
by independent contributors. 

Risa itself does not have a programming 
environment, or graphic or numerical software; 
therefore, these services must be provided by 
other sources. Risa can serve as an algebraic 
computation engine if requested by other soft- 
ware. The use of existing, well designed soft- 
ware seems to be a more reasonable solution 
than the development of a special global soft- 
ware package. 

The designers of most computer algebra 
systems have been eager to _ incorporate 
additional features. For example, numerical and 
graphical features, editors, two-dimensional 
formatters, and various kinds of input and 
display facilities. Because of this strategy, 
computer algebra systems have become 
enormous and prone to inflexibility when han- 
dling access requests from outside the system. 

Macsyma is a computer algebra system that 
has a large set of symbolic, algebraic, and 
numerical operations. Mathematica has the 
‘note book’, which is a kind of editor front end 
special to the Macintosh and NeXT computers. 
The note book can display high-quality three- 
dimensional graphics with many numerical 
functions. 

A typical problem with these systems is that 
the entire computer algebra system must be used 
even when only a part of it is required. Also, the 
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Fig. 1—Current structure of risa system. 


user may need to use the best organized existing 
simulation system with accuracy guaranteed, or 
the user may prefer an editor to that provided 
by the system. 


2.2 Structure of the risa system 

The current risa system consists of three 

parts (see Fig. 1): 

1) Risa basic algebraic engine, 

2) Risal and asir — small programming language 
interpreter-evaluators, 

3) Libraries written in C or asir. 

Basic functions whose operating speeds have 
a direct and significant effect on the system 
performance are written in C language. These C 
program codes (basic and _ general-purpose) 
are aggregated to an engine. Functions that are 
not basic but nevertheless sensitive to operating 
speed are also written in C language. These C 
program codes constitute a special purpose 
library in C language. Functions that are neither 
basic nor sensitive to operating speed are written 
in other programming languages, especially risal 
and asir. 

Our independent open module structure 
strategy enables the use of several different 
engines that can be served for different ob- 
jectives. Moreover, risa can be connected to 
more than one copy of the same engine. For 
such a multi-engine environment, an extension 
of risa that supports control protocols is now 
being planned. 
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Fig. 2—Future risa system. 


The risa basic algebraic engine performs 
mainly polynomial arithmetic with rational 
number coefficients, and numerical and graphic 
engines will be incorporated in the future. The 
so-called parallel Risch algorithm for indefinite 
integration of a certain class of elementary func- 
tions is implemented as one example of a C- 
coded library. A library for factorization over 
algebraic extension fields is written in asir. 

The future structure of risa will be open to 
many other program modules (see Fig. 2). 


2.3 Running environments 

Risa is being developed on the UNIX operat- 
ing system, and in principle also runs on any 
UNIX system. The common types of machines 
on which risa can run include Sun3, Sun4 (Sparc), 
VAX with the Ultrix operating system, News, 
RISC News, NeXT, DecStation, and Apollo 
Domain. 

As mentioned in Sec. 2.1, risa has neither 
an editing facility nor a two-dimensional print- 
ing facility. However, any popular editor (e.g. 
emacs or vi) can be used to code programs in 
asir, and any front-end input editor (e.g. fep or 
emacs) can be used to input asir commands. The 
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TEX"°* style output of expressions on a screen 
or on paper become available if a TEX style 
output module is connected to the risa system. 

The object size of the current risa system is 
about 300 Kbytes, including the risa basic 
algebraic engine and asir. The working memory 
can be expanded as required to meet the com- 
putational needs. 

Risa’s memory management is done using 
UNIX system calls for memory allocation. Risa 
does the following when it requires storage bytes 
for data: 

1) Searches the free-list in the heap space for 
available memory. 

2) If sufficient free memory is not found, 
invokes the garbage collection subroutine. 

3) If sufficient memory bytes for the request 
cannot be reclaimed, issues a system call to 
increase the heap area. 

Risa’s garbage collection method follows the 
‘Garbage Collection in an  Uncooperative 
Environment’ by Boehm and Weiser”). This 
method is very simple and is easily incorporated 
into software without ‘significant co-operation’ 
from the caller. Such independence of program 
codes perfectly meets risa’s requirements. 


2.4 Portability 

Most of risa is written in C language, but a 
very small fragment of code for long integer 
multiplication and division is written in 
assembler. This fragment is so small that its 
affect on the system portability is negligible. 
Another system dependent part is ‘garbage 
collection codes’. This part contains several 
assembler codes for different hardware and, of 
course, depends upon standard system calls of 
UNIX C for memory allocation. This part also 
depends upon the way in which the compiler 
manages memory for data objects. Machine 
dependent codes are provided for those 
machines mentioned in the first paragraph of 
Sec. 2.3. For small computers with limited 
address spaces and for mainframes without 
linear address spaces, machine dependent codes 
for garbage collection are not currently available. 


Note: A computer typesetting system created by Knuth). 
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However, it is not so difficult to implement 
garbage collectors for respective machines if 
their operating systems provide an environment 
for memory management that is similar to 
UNIX. 

The system (hardware, operating system, 
and compiler) dependent property of garbage 
collection is the only limitation on _ risa’s 
portability. 


2.5 Implementation language: C vs. Lisp 
General-purpose computer algebra systems 

used to be implemented in Lisp (e.g. Macsyma 

and Reduce). But recent commercially prevailing 
computer algebra systems (e.g. Maple and 

Mathematica) are written in C language or one 

of its relatives. 

The reason for the current trend may be as 
follows: 

1) Computer algebra systems include symbol 
manipulation. Initially, Lisp was the only 
solution for symbol manipulations that 
included problems such as handling variable 
sized structured data for symbolic ex- 
pressions and dynamic memory allocation 
with garbage collection. 

2) Because of the rapid progress in computer 
algebra technology, the main problems with 
computer algebra systems have shifted from 
symbol manipulation to algebraic computa- 


tion. 
3) Also, research and development has estab- 


lished a methodology for grabage collection. 

Therefore, Lisp is no longer the only solu- 
tion for computer algebra systems. Moreover, 
the following points have to be taken into 
accout: 
1) System size 

A Lisp system is usually very large; therefore, 
a computer algebra system constructed in Lisp 
is also large unless a small sized Lisp system 
specially designed for the computer algebra 
system is used. Conversely, a C implementation 
of a computer algebra system generates a relative- 
ly small (sometimes very small) computer 
algebra system. 
2) Linking other systems 

Because a Lisp system usually has a special 
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storage structure and usage, Lisp-based systems 
are usually subject to restrictions when linked 
to other software. Although most Lisp systems 
can be linked to other language based systems, 
the link is usually biased in one direction (i.e. 
calling from Lisp is usually well organized 
but calling to Lisp is often awkward and some- 
times impossible). On the other hand, there are 
very few restrictions on accesses between C 
based systems and other language based systems. 
3) Portability 

C language is the standard system/applica- 
tion programming language in the UNIX envi- 
ronment. All UNIX systems have a C compiler. 
Recently, even small computers and mainframes 
have started to support C language. Compared to 
C language, Lisp has split into many dialects and 
cannot be a standard implementation language. 
4) Accessibility to internal data structures 

Some advanced users often want to know 
about internal data structures. In principle, this 
presents a problem to designers in terms of the 
kind of access methods that are available to 
users, and how convenient and efficient they 
are. This is not a problem of which language 
the system is written in. Reduce (in Lisp) opens 
its internal structures in detail, but Macsyma 
(also in Lisp) does not. Risa (in C language) 
opens its internal structure using primitives to 
access basic data structures, but Mathmatica 
(also in C) does not. 

Risa is mainly implemented in C language to 
enable independent cooperation with other 
systems. 


3. The risa basic algebraic engine 

In the following discussion, data objects of 
risa are described using tuple representation, 
for example, (a, 6, y) for structures of fixed 
size in C language. For variable sized data, the 
list representation, denoted by la, Bierce yt, is 
used. In implementation, the latter is not 
necessarily stored in a usual list structure with 
pointer fields, but may be stored in a variable 
sized array. 

We will use the following convention. Let 
a be a data object to be stored in risa, and let 
a be the internal representation of data object 
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q@ in risa. 

Although these conventions give only an 
informal concept of data objects, this approach 
is sufficient to illustrate the data representation 
of the system. 


3.1 Basic data types 

3.1.1 Integers and rational numbers 

In risa, the most basic mathematical data 
objects are rational numbers. Integers are a 
special kind of rational number. Integer 0 (zero) 
has a special form, namely, four null bytes. 

Let a and b be relatively prime positive 
integers. Also, let o be +1 or —1. A rational 
number oa/b is stored in the risa system as a 
structure of four members: (‘rational’, o, @, B). 

Here, d and 6 denote the representation of 
positive integers a and b by arrays in C language. 

As in ordinary arithmetic, such potentially 
large positive integers are represented using 
a fixed positional number system. In risa, the 
radix B is 27’. In an array, the B-ary digits of 
an integer are arranged so that the less signif- 
icant digits occupy the smaller index posi- 
tions. That is, a positive integer represented by 
a =a) +a,B +--+ -+a,B8" is stored in the array 
G={do,4,,°°*, an}. 

Integer oa is stored in the form (‘rational’, 
o,a,0). 

Do not worry about the inconsistency with 
the usual meaning of rational numbers as this is 
only a programming convention. (To be con- 
sistent with the rational number representation, 
the last member of the above representation 
should be 1.) 

3.1.2 Polynomials 

Polynomials play a central role in risa. 

There are many ways to represent poly- 
nomials in a computer. Each way has its own 
advantages and disadvantages. The risa basic 
algebraic engine uses so called canonical recur- 
sive representation. In this representation, 
a polynomial in several indeterminates is first 
considered as  uni-variate polynomial with 
respect to an indeterminate with coefficients 
(possibly polynomials) in the remaining inde- 
terminates. 
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Thus, the polynomial 


P=ayx" +a,_ x"! 


5 oe : SS SSN. n t/a 

in x with degree n and a, #0, is stored in risa 

using the following data structure: 
(‘polynomial’, ‘x’, ECPS), 

where ECPS (exponent-coefficient pairs) is a list 


{(H, an), (" — 1, an_1), -.., (1,41), (0, a)}. 


The indeterminate x is referred to as the 
main variable of polynomial P. The coefficient 
dn is referred to as the leading coefficient of 
polynomial P, and is denoted by lc(P). 

In risa, if a coefficient, say a;, is zero valued, 
the corresponding exponent-coefficient pair, 
(i,a;), is not included in the above ECPS rep- 
resentation, thus saving storage space. This 
type of representation of polynomials is called 
sparse representation. For reasons that are 
complicated, the majority of computer algebra 
systems, risa included, use sparse representation. 

A multi-variate polynomial is stored re- 
cursively so that the coefficients a; are again 
polynomials. For a multi-variate polynomial 
P represented in such a recursive structure with 
respect to the ordering of indeterminates, we 
define the leading number coefficient, denot- 
ed by Inc(P), using the following recursive for- 
mula: 


Inc(P) = if Ic(P) is a number then Ic(P) else 
Inc(Ic(P)). 


The representation of polynomial in risa is 
unique provided that the ordering for in- 
determinates appearing in the polynomial is 
given. This type of data representation is called 
canonical representation. 

3.1.3 Rational expressions 

Let A and B be polynomials that are not 
necessarily relatively prime. A rational expres- 
sion A/B is stored in the following form: 


(‘rational expression’, A, B). 


In this representation, we assume that the 
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leading number coefficient of B, Inc(B), is 
positive. 

Unlike the representation of polynomials 
and rational numbers, the representation of 
rational expressions in risa is not unique (i.e. not 
canonical). This is because we do not assume 
relative primeness between numerator poly- 
nomial A and denominator polynomial B. No 
automatic reduction is done in the risa basic 
algebraic engine. Reduction of common factors 
is determined by the user. 

3.1.4 Functors and kernels 

In risa, a monomial function form, such as 
f(x +y) (i.e. a functor with one or more argu- 
ments) is called a kernel. The risa basic algebraic 
engine cannot distinguish kernels from ordinary 
indeterminates (e.g. x, y). Kernels are treated in 
polynomials and in rational expressions as 
ordinary indeterminates. That is, the risa basic 
algebraic engine cannot recognize kernels as 
objects with structures. 

Therefore, the risa basic algebraic engine 
does not directly support the data structure of 
kernels or operations on kernels. Instead, such 
data structures and operations can be supported 
by programming languages. This task division 
frees the algebraic engine from complex data 
handling tasks, and allows it to concentrate on 
manipulating polynomials. At present, monomi- 
al function forms are fully supported by risal 
and partly by asir. Functors and kernels are 
further described in Subsec. 4.3.1. 


3.2 Basic algebraic operations 

3.2.1 Arithmetic of integers and rational 

numbers 

Arithmetic operations on integers and 
rational numbers (fractions) are the basis of the 
algebraic system. 

For integers represented in the above struc- 
ture, addition, subtraction, multiplication, and 
division are rather straightforward because they 
can be done on the B-ary number system in a 
similar way as decimal arithmetic is done by 
hand. In particular, division of two long B- 
ary integers is carried out following the meth- 
od shown by Knuth?) which estimates the 
next digit of the quotient with at most one 
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Table 1. Time to compute n! for several values of n 


n 10 100 1000 | 10000 
Time (ms) 0 10 310 |47000 


Table 2. Time to compute exact division of two long 
integers, (10)! and (10"~'!)! 
Expression | 100!/10! |1000!/100! |10000!/1 000! 
Time (ms) 0 30 4680 


Times for (10”)! and (10”*!) are not included. 
failure in the division" . 

Table 1 shows the computation time for 
n!=1x2x-+--+xn using a built-in function for 
several values of integer n. For comparison, see 
Tables A-1 and A-2 in Appendix A.1 which 
show the computation times for the same 
function using user-defined functions. The com- 
putation times for division of long integers are 
shown in Table 2. 

Arithmetic operations on rational numbers 
(fractional arithmetic) are also straightforward, 
but they must be carefully carried out so that 
the integers in the caluclations are kept as small 
as possible. For this purpose, greatest common 
divisor (GCD) operations between two integers 
must be done. The greatest common divisors are 
used to remove, in advance, the common integer 
factors of the final numerator and denominator. 
This technique also follows the method shown 
by Knuth®. 

3.2.2 Arithmetic operations on polynomials, 

and rational expressions 

Arithmetic operations on polynomials is of 
central importance in the risa basic algebraic 
engine. In some ways, arithmetic operations on 
univariate polynomials are similar to and some- 
times simpler than operations on integers. This is 
because there is no need to perform carry-over 
or borrow-from operations among monomials of 
different exponents when adding or subtracting 
two polynomials. The calulation of powers of a 
polynomial can be speeded up very effectively 
by using binomial coefficients and binomial 
expansion: 


Note: The possibility of a failure is extremely small. 
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(a+py" =a" +(" a"! B+---+/ ja"-ig' 


1 


n 
+( ag”! +p”. 
n—| 

For powers of a multi-nomial, binomial 
expansion is recursively applied by partitioning 
the multi-nomial to nested binomials, e.g.: 


(a+ B+---+W+ow) 


SS tess Oe + cae s= JD. 


For multi-variate polynomials, addition, sub- 
traction, and multiplication can be carried out 
using the same procedure applied recursively 
with respect to thier coefficient polynomials. 
Division, however, is more difficult. 

Let A and B be uni-variate polynomials in 
the same indeterminate, (e.g. x) with rational 
number coefficients. Dividing A by B produces 
the quotient Q and a remainder R such that 


A=QB+R, 


where Q and R are both uni-variate polynomials 
in x with a rational number of coefficients, 
and degree,(R) < degree,(B). If A and/or B 
contain one or more extra indeterminate, 
recursive application of division to their coeffi- 
cient polynomials leads to difficulty. Division 
usually produces a quotient and a remainder, 
but in indeterminates other than x we want to 
get a single polynomial for a coefficient of the 
quotient polynomial in x. One possible solution 
is to divide coefficients in rational expressions. 
Another way is to constrain the polynomials so 
that only cases in which exact division is possi- 
ble are allowed. By ‘exact division of A by B’, 
we mean that 


A=QB+R, 


holds for multi-variate polynomials A, B, Q, 
and R with rational number coefficients, and 
degree,(R) < degree,(B). A third possible 
solution is to make a formal fraction A/B. This, 
however, is simply arithmetic over rational 
expressions. 

Thus, in many ways, dividing a multi- 
variate polynomial by another multi-variate 
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Table 3. Time to compute several powers of 


polynomials (Unit: ms) 
n i0 | 20 | 30] 40 | so 
(a+1)" 12 | 21 | 34] aa | es 
(a+b) 18134 162 |90 | 12 
(a+b+c)" 11 | 42 | 92 | 180 | 300 
(a+b+ct+d)" | 45 | 435 |1600|4130 1/8190 
(a+b+c+d+e)" | 160 [3860 16 400/58 700| — 


—: Running out of 64 Mbytes of virtual memory 
(32 Mbytes of real memory). 


polynomial is not unique. The risa_ basic 
algebraic engine uses the second solution, 
namely exact division is supported. Other 
variants of divisions can be described in program- 
ming languages. 

For rational expressions, arithmetic is easy 
unless a reduction of common factors is counted. 
This can be done by simply combining addition, 
subtraction, and multiplication of polynomials 
and making formal fractions from two of them. 
Finding the common factor between numerator 
and denominator completely depends on the 
GCD operation. This is described in the follow- 
ing section. Since GCD operations are expensive, 
the risa basic algebraic engine does not perform 
automatic reduction of common factors. 

The computation times for several poly- 
nomial powers are shown in Table 3. 


3.3 Higher algebraic operations 

The risa basic algebraic engine supports GCD 
computation and factorization of polynomials 
over the rational number field. GCD computa- 
tion and factorization over finite fields GF(p), 
where p is a prime number, are performed only 
internally and are not currently open for exter- 
nal use. Uni-variate factorization over algebraic 
extension fields of rational numbers is supported 
by an asir library. 

GCD and factorization operations 
different type coefficient domains mutually 
depend on each other. Figure 3 shows the 
relationship between algorithms currently used 
in the risa system. 

3.3.1 GCD of uni-variate polynomials over 

the rational number field 

As far as polynomial factors over the rational 
number field are concerned, it is sufficient to 


over 
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*) 


Fig. 3—Relationship between factoring and GCD 
algorithm in risa system. 


Factoring 
GF (p)[x] 


Berlekamp 


deal with polynomials with integer coefficients. 
Since a Euclidean algorithm for finding the GCD 
shows a strong intermediate coefficient swell, 
several improvements have been devised. Of these, 
the sub-resultant polynomial remainder sequence 
algorithm (sub-resultant PRS) family*”’ >) and 
the modular GCD algorithm are dominant. Risa 
uses a modular GCD algorithm. Empirically, 
there are some cases where the sub-resultant 
PRS algorithm outperforms the modular 
algorithm. Overall performance, however, of the 
modular GCD algorithm is considerably higher 
than that of the sub-resultant PRS algorithm. 

Modular algorithms are frequently and 
commonly used in calculations of polynomials 
with very large coefficients. 

The essence of a modular algorithm is to 
convert the original problem into its counterpart 
in a coefficient domain in which computation 
can be easily performed. For a GCD computa- 
tion, integer coefficients are projected onto one 
or more finite fields, particularly onto Galois 
field GF(p) for several prime integers p. Then, 
computation of the desired objects (e.g. factors) 
is performed over such simple domains. 
Arithmetic operations on polynomials can be 
performed very quickly in GF(@) without an 
intermediate coefficient swell. Finally, the 
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images of the desired objects in the projected 
domain(s) are projected back into their integer 
counterparts by using a Chinese remainder 
algorithm®? or Hensel lifting. 

Details on the Chinese remainder algorithm 
and Hensel lifting can be found in the references 
of this article?) 7)» ®), 

3.3.2 Factorization of uni-variate poly- 

nomials over the integer domain 

Factoring uni-variate polynomials over the 
integer domain is a very attractive problem. 
It is reported*? that the astronomer Friedrich 
von Schubert devised an algorithm to find all 
factors of a uni-variate polynomial over the 
integer domain. It is also reported®) that about 
90 years later L. Kronecker rediscovered von 
Schubert’s algorithm. Unfortunately, this algo- 
rithm is very inefficient when the degree of 
a polynomial is greater than five. 

Modern algorithms for factoring polynomials 
have their origin in E.R. Berlekamp’s paper??, 
which presents a factoring algorithm over finite 
fields. H. Zassenhaus!®) devised an algorithm 
which finds factors over the integer domain. 
Many improved algorithms and subalgorithms 
based on the above have since been proposed. 
Risa implements our own improvements of some 
of these algorithms. 

We will now explain the meaning of the term 
square-free. A polynomial f(x) is square-free if it 
has no multiple factors. 

Factorization of a uni-variate polynomial in 
risa is done in three major steps: 

1) Square-free 

Extract the square-free components of the 
polynomial and their multiplicities so that the 
next steps are effective. 

2) Factoring over GF(p) 

Factor out the given square-free polynomial 
into its irreducible factors over finite field 
GF(p) for several small primes p. 

3) Lifting up and finding the true factors 

Lift the candidate factors, then test each of 
them for a true factor by trial division over the 
integer domain. 

For the first step, risa uses a p-adic Hensel- 
type algorithm, which is mentioned but not 
explicitly described by P.S. Wang and B.M. 
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Trager!?). 

The algorithm is based on correct image 
factors over GF(p) for a large prime p. Then, the 
images, i.e. square-free factors over GF(p) for 
prime p, are lifted in parallel using a modified 
Hensel-type algorithm. 

This algorithm makes factorization in the 
risa basic algebraic engine of uni-variate poly- 
nomials over the integer domain very fast. 

The second step follows Berlekamp’s 
algorithm. A problem with this step is that there 
is no assurance of obtaining lucky primes that 
yield correct images over GF(p). So, in general, 
we cannot assure that a factor over GF(p) is a 
correct image of a true factor over the integer 
domain. 

We therefore need a third step to verify 
the correctness. Several combinations of image 
factors have to be tried and lifted until a true 
factor is found. In the worst case, the required 
number of trials of such a verification procedure 
is exponentially proportional to the degree of 
the polynomial to be factored. There are an 
infinite number of examples where an irreducible 
polynomial over the integer domain splits into at 
most quadratic factors over GF(p) for every 
prime p!”?. 

A.K. Lenstra, H.W. Lenstra, and L. Lovasz!»? 
presented an_ algorithm — Basis reduction 
algorithm in a lattice — which factors a_ uni- 
variate polynomial within polynomial time with 
respect to the degree of the polynomial. The key 
idea is that a lattice point with a small norm 
in the lattice generated from the polynomial 
to be factored corresponds to a true irreducible 
factor. 

Theoretically, the lattice algorithm is cur- 
rently the best algorithm for the worst case. In 
practice, however, the trial-and-verify method is 
faster in most cases. As was shown by Yokoyama 
and Takeshima!”, an extension of the lattice 
algorithm can be applied to certain Euclidean 
rings. Abbott et al!?) employed the algorithm 
to factorize uni-variate polynomials over 
algebraic extension fields, which shows its 
effectiveness in extremely ill-conditioned cases 
(see also Sec. 5.1). 

The lattice algorithm is not currently 
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Table 4. Time to factor uni-variate polynomials 


Table 5. Time to factor uni-variate polynomials 
(Polynomials are given in Appendix B.1) 


SCICICS 
3 490| 450 | 430 | 650 | 210 


implemented in the risa basic algebraic engine. 

Tables 4 and 5 show the time required to 
factor uni-variate polynomials. 

3.3.3 GCD computation and factorization 
of multi-variate polynomials over the 
integer domain 

For a multi-variate polynomial, the risa basic 
algebraic engine essentially uses an EZ algo- 
rithm by Moses and Yun'®?, and by Wang and 
Rothschild'® that has been improved by 
M. Noro et al.'”). The improved algorithm can 
be regarded as a simplified version of Wang’s 
EEZ algorithm'®. 

PRS algorithms are also applicable for mul- 
tivariate GCD computation; see Hearn!”?, 
Brown”, and Suzuki and Sasaki?!. Modular 
algorithms of the Chinese remainder type, e.g. 
Brown®) and Collins?”), are more suitable than 
PRS algorithms, especially for dense multi-variate 
polynomials. On the other hand, Hensel type 
(EZ and EEZ) algorithms are considered best for 
sparse multi-variate polynomials. Since, in 
practice, most multi-variate polynomials dealt 
with are sparse, Hensel type algorithms are most 
practical. 

The EZ algorithm reduces the multi-variate 
polynomial problem of GCD and factorization 
into a uni-variate polynomial problem by sub- 
stituting indeterminates with integers. This 
substitution is called evaluation, and the sub- 
stituted integer value is called an evaluation 
point. The evaluation can only be done under 
several constraints. One important constraint 
is that the evaluation must preserve the square- 
free property of the given multi-variate poly- 


Time 
(ms) 
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nomial after evaluation. The true multi-variate 
factors are reconstructed by an _ extended 
Zassenhaus algorithm (i.e. an extension of p-adic 
Hensel lifting). 

Using the EZ algorithm for multivariate 
GCD and factorization has three major problems: 
the leading coefficient, extraneous image factor, 
and bad zero substitution. 

The enhanced EZ algorithm (EEZ algorithm) 
offers practical improvements in these areas. The 
main improvement is predetermination of the 
leading coefficients of the factors. Also, because 
the improved p-adic construction procedure 
(parallel lifting of all factors) conserves the 
sparsity of multi-variate polynomials, the bad 
zero substitution problem is solved. This solu- 
tion considerably reduces the extraneous factor 
problem by using several different sets of 
integers to factor a uni-variate polynomial over 
the integer domain. The existence of an evalua- 
tion which meets the constraints of the 
algorithm has not been proved. However, such 
an unlucky case is considered to be very rare, 
and the theoretical drawback does not affect the 
effectiveness of the EEZ algorithm. 

The risa basic algebraic engine does not yet 
support the EEZ algorithm, partly because 
the algorithm is extremely complicated and 
requires very careful coding. Another reason 
is that we have devised an improved EZ 
algorithm, that can be regarded as a simplified 
EEZ algorithm. This improved algorithm is 
briefly explained below. 

When a uni-variate factor and its co-factor 
are lifted using the EZ algorithm, their leading 
coefficients have to be tentatively predeter- 
mined. The simplest way to do this is to provide 
the leading coefficient of the given polynomial. 
However, this method is very inefficient because 
the number of lifting steps increases if the 
total degree of the leading coefficient is large. 
As for GCD, an estimation of the leading co- 
efficients of GCD can be obtained by obtaining 
the GCD of the two leading coefficients of the 
polynomials. Obviously, this method cannot be 
applied to factorization. The EEZ method com- 
pletely predetermines the leading coefficients 
from the factors of the leading coefficient. Our 
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Table 6. Time to factor multi-variate polynomials 
(Polynomials are given in Appendix B.2) 


Poly- 


: m m m m m m 
nomial 1 2 3 4 5 6 


Time 
(ms) 
SL a a Pn Pe PT ime le! 


170 | 130 | 190 |2 2601 400)3 550/110 |16 700 


Poly- 
nomial | Mg | Mio | My | M42 | M43 | Mig [M415 
Time |7 7904 020'7380) 20 |1 350| 480 |450 
(ms) 


Table 7. Time to factor multi-variate polynomials 
(Polynomials are given in Appendix B.2) 
Polyno- 
mial 


dy | dg ds dy |dy | dg | do | Bip 
10 | 20 | 30 |250|290|370|1 140|2 310 |14 840 
(mis) 


Time 


Table 8. Time to factor multi-variate polynomials 
(Polynomials are given in Appendix B.2) 


Polynomial | 5; | s $3 54 Ss 56 
Time(ms) |40 |310 |180 | 2000 | 4240 | 6690 


technique is to evaluate the factors of the lead- 
ing coefficient in order to estimate the leading 
coefficients of factors. In more detail, evalua- 
tion points are chosen so that the factors of the 
leading coefficient at the evaluation points do 
not divide each other. The leading coefficients 
of factors can be estimated by counting the 
mutual divisibilities of the integers obtained 
by the evaluation. 

The existence of such an evaluation that 
meets the above condition and also preserves 
the square-free property of the given square-free 
polynomial has not been proved. Therefore, 
our technique is as heuristic as the EEZ 
algorithm. 

Examples of times required to factorize 
multi-variate polynomials over the integer 
domain are shown in Tables 6 to 8. 


4. Asir — a programming language 

The separation of programming languages 
from computation engines is a consequence of 
our independent open module strategy. We have 
already implemented two language prototypes: 
asir and risal. Risal has more data types than asir 
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and is implemented without counting efficiency. 
Asir allows limited data types and its inter- 
preter/evaluator is more efficient. Asir’s syntax 
is essentially the syntax of C language. 

Our experiences in developing two language 
prototypes have shown the advantages of 
separating engines and languages. It will not be 
difficult to implement programming languages 
having a syntax similar to such languages as 
FORTRAN, Algol, Pascal, and BASIC according 
to the users’ preference. 

In the following section we describe the asir 
language, which we regard as a standard language 
prototype. (We will enhance and improve asir in 
the future.) 


4.1 Overview of asir 
Asir was developed as a simple but efficient 

user programming language to interface between 
users and the risa basic algebraic engine. Higher 
level programming features are studied by 
developing risal, asir’s elder brother. Asir was 
carefully designed so that it will not be subject 
to major changes in its programming features in 
the future. In many ways, this careful but rather 
awkward strategy has made asir immature. How- 
ever, asir is still under development and is grow- 
ing steadily. 

Asir’s interpreter/evaluator is written in C 
language. The use of a C preprocessor in the 
parsing process of the asir interpreter makes it 
easier to define macros, aliases, etc. Asir’s 
parser generates a program tree that can be sub- 
sequently interpreted and evaluated. Asir’s inter- 
preter/evaluator interprets programs and 
dispatches primitive jobs to the risa basic 
algebraic engine. A list of indeterminates that 
sometimes includes kernels is sent to the engine 
together with the data. This list shows the 
ordering of indeterminates in the data. 

Asir Version 0 supports the following func- 
tions: 

1) Arithmetic operations on integers, rational 
numbers, polynomials, and rational expres- 
sions 

2) Primitive operations for vectors and matrices 

3) Minimum list processing features 

4) A C-like programming language 
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5) Several built-in functions/operations 
6) A dbx-like debugger. 


4.2 Syntax of asir 
The asir syntax is based on C language. The 

major differences between the two are explained 

below. 
Limitations 

1) Asir has no types for program variables or 
indeterminates; therefore, types cannot 
currently be defined. 

2) Program variables declared in a function are 
local to the function by default. If program 
variables in a function are declared ‘external’, 
they are global at the top level. 

3) A program variable begins with a capital 
letter. Indeterminates and functions begin 
with a lower-case letter. 

4) ‘switch statements’ and ‘goto statements’ are 
not allowed. 

5) ‘Comma expressions’ are allowed for A, B, 
and C in ‘for (A; B; C)’ or for A in ‘while (A)’. 

Extensions 

1) Computations on rational expressions and 
polynomials are supported. 

2) Lists can be processed. 


4.3 Compound data types and operations 

4.3.1 Functors and kernels 

Functors are function symbols of kernels 
(monomial function forms). For example, 
kernels sin(x) and f(xy, x — y) have functors 
sin and f respectively. Syntactically, functors 
are indistinguishable from ordinary functions, 
but they remain in the results after evaluation, 
whereas ordinary functions do not. In usual 
arithmetic, kernels are treated as unstructured 
objects; therefore, the risa basic algebraic 
engine does not recognize their structures. For 
example, sin(x) is treated as a single indeter- 
minate independent and distinct from any other 
indeterminates and kernels. Typical operations 
that recognize the structural composition of 
kernels are differentiation and integration. 
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These operations depend upon the actual 
arguments of a kernel as well as its functor. 
At present, operations for functors and kernels 
are only experimental and are insufficient to 
support computations by rewriting rules. 

4.3.2 Lists 

Lists are supported in asir. A list having 
elements @,, Q, °**, Q is denoted in asir by 
[Q,, Q,-°°*,%]. A null list (list having no 
member) is denoted by [ ]. 

Five primitive operations on lists are 
supported: 

1) [a,,@,°°*,% J]: generates a list. 

2) car(L): gives the first element of non-null list 
I. 

3) cdr(L): gives the list obtained by removing 
the first element of non-null list L. 

4) cons(H, T): gives a new list L such that 

car(L) = H and cdr(L)=T. 

5) ‘==’: equality test for lists. 

4.3.3 Vectors and matrices 

Vectors and matrices are supported in asir 
with their primitive operations, i.e. addition, sub- 
traction, multiplication, scalar multiplication, 
and scalar division. Exponentiation by a positive 
integer is allowed for rectangular matrices. Sub- 
sequently, operator symbols +, —, *, /, and * are 
polymorphic operators. Such polymorphic 
operators are interpreted according to the data 
types of their operands at evaluation. 

More complicated operations, such as inver- 
sion of matrices, are not currently supported. 
Because these more complicated operations 
tend to explode unless carefully designed 
algorithms are used, they will be supplied by 
libraries. 

Asir does not currently support arrays. 
Elements of a vector or matrix can be of any 
type. This somewhat alleviates the incon- 
venience caused by the absence of arrays. 

An element designator of a vector or matrix 
is treated as one programming variable. There- 
fore, it can appear at the left side of assignment 
statements. 
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4.4 Built-in functions 


This section explains several built-in functions that are not mentioned elsewhere. 


The meanings of arguments in this section are as follows: 


obj : arbitrary 
rat, rat, : rational expression 
poly, poly ,, defpoly : polynomial 

var : indeterminate 
list : list 
vect : vector 
mat : matrix 

int, deg, len : integer. 


1) Manipulation of objects 


var(rat) : 


vars(rat) : 


red(rat) 


nm(rat) : 
dn(rat) : 


coef(poly, deg, var) : 


deg(poly, var) 


size(vect), size(mat) : 


type(obj) : 


2) Mathematical operations 


fac(int) : 


gcd(poly, , poly,) : 


agcd(defpoly, poly, , poly ,) 


subst(rat, , [var, rat, | *) 


res(var, poly, , poly,) : 
fetr(poly) : 


afctr(defpoly, poly) : 


diff(rat, var) : 


int(rat, var) : 
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main-variable of rat 


list of indeterminates contained in rat 


: reduced rational expression of rat 


numerator of rat 
denominator of rat 


coefficient of poly at degree deg with respect to var 


: highest exponent of poly with respect to var 


[length of vect], [row size of mat, column size of mat] 


type classifier of obj 


factorial of int 


GCD of poly, and poly, 


: GCD of poly, and poly, over an algebraic extension field 


with its defining polynomial defpoly 


: substitute rat, for var in rat,; left to right if repeated 


resultant of poly, , and poly, with respect to var 
factorize poly ; result is a list of form 

[ [multiplicity , , factor, ], ...] 
factorize poly over algebraic extension field with defining 


polynomial defpoly 
differentiate rat by var 


integrate rat by var; loading integration library is necessary 
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3) Special divisions 


sdiv(poly, , poly) : 
srem(poly, , poly) : 
idiv(int, ,int,) : 


irem(int,,int,) : 


4) Miscellaneous 


newvect(/en) : 


newmat(row, co/) 


print(obj) : 


cputime(0 or 1) : 
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exact division giving quotient 
exact division giving remainder 
integer quotient 


integer remainder 


generate a new vector with length Jen 


: generate a new matrix with row size row and column size col 


display obj followed by a newline 
enter displaying mode of CPU-time if argument is 1; 


exit if argument is 0 


debug : enter ‘debugging mode’ 
load(‘‘filename’’) : read programs from file filename; 
object files (relocatable object codes generated by C compiler) 
can also be read 
end, quit : close the current reading file; close asir session at the top level 


| statements is controlled by several commands, 
some of which are shown below. 


4.5 User defined functions 
The user can define functions by using the 


‘def? statement. Many syntax errors may be de- 1) ‘step’ and ‘next’ execute the next statement. 
tected at parsing. These errors are indicated 2) ‘stop at’ sets a break point on the source 
by their approximate position in the reading file. line. 

A function is distinguished by its name regard- 3) ‘delete’ removes a break point. 

less of the number of arguments it contains. If 4) ‘print’ displays the value of an expression. 
the same name functions are defined more than 5) ‘cont’ lets asir exit from debugging mode 
once, only the last one is valid for that name. and continue asir evaluation. 


Appendices A.1 and A.2 show example 6) ‘list’ displays source lines. 
programs and the timing data. 7) ‘quit’ lets asir exit from debugging mode. 
4.6 Asir debugger 5. Libraries 


Asir has its own dbx-like debugger". To 
enter debugging mode, type-in ‘debug’; at the 
top level. Asir also enters debugging mode when 
it reaches a break point, or when the ‘d’ option 
is selected at interruption. 

The debugger has a minimum set of dbx 
commands. Their functions are mostly similar to 
those of dbx commands. Execution of asir 


Note: dbx is a standard symbolic debugger of BSD UNIX. 
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5.1 Factorization over algebraic extension fields 

Factorization using norm is experimentally 
implemented in asir. Factorization was first 
written in C language as a sub-module of the 
risa basic algebraic engine. Because of our policy, 
this type of non-general routine was rewritten in 
asir language. No reduction in efficiency was 
found after rewriting. 

Some literature states that the norm method 
is inefficient compared to methods that use 
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Hensel lifting. Abott et al!) reported several 

exceptionally ill-conditioned cases in which 

lattice methods seem to be the only practical 
solution. Our experience shows that norm 
methods using our improvements are practicable 

for not only usual cases but also for some ill- 

conditioned cases. 
In the discussion below, we will use the 
following notation: Z for the set of integers; 
Q for the set of rational numbers. 
Let a@ be algebraic over Q with its minimal 
polynomial g(t). LetQ(a@) denote an algebraical- 
ly extended field of Q by adjoining a to Q. The 
problem is to factorize f(x) © Q(a)[x] over Q(a). 
The original algorithm does the following: 
Step 1: Let r(x) = resultant,(f(x — st), g(t)) for 
an integer s. Search for a square-free 
r(x) for s. 

Step 2: Factorize a square-free r(x) overQ. Let 
h(x) be one of the irreducible factors. 

Step 3: Let k(x) =GCD(h(x + sa), f(x)) over 
Q(a@), where k(x) is an irreducible factor 
of f(x) over Q (a). Here, we can use any 
algorithm to compute a uni-variate GCD 
over Q (a). 

The polynomial r(x) in Step 1 is identical 
to what we call the norm of f(x —sa) from 
Q(@)[x] onto Q(x). 

Our improvements are as followings: 

1) Computation of the resultant in the original 
algorithm, Step 1, takes considerable time. 
Therefore, let us consider to make use of the 
non-square-free r(x). r(x) contains all facors 
of f(x — sa). Therefore, in the original algo- 
rithm steps, any factor of such an r(x) pro- 
duce a non-trivial factor of f(x). Although 
the obtained factors are not guaranteed to 
be irreducible over Q (qa), a reduction in the 
size of the problem can be expected. 

2) Further observation tells us that for factors 
of single multiplicity, GCD in Step3 is 
irreducible. Thus, r(x) can be used inde- 
pendently from its square-free property. 

3) A candidate factor of uni-variate factoriza- 
tion in Step 2 is determined to be extraneous 
if its degree is not a multiple of the degree of 
g(t). This is because an irreducible factor, 
say h(x), of a square-free part of r(x) over 
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Table 9. Time to factor uni-variate polinomials f(x) over 


algebraic extension fields Q(a), where @ is a 
root of each f(x) to be factored 


(Polynomials are given in Appendix B.3) 
Polynomial | a; | a2 | a3 aq ds dg |a7 \ag 
Time (ms) | 70 | 80 | 470) 4 100|22 430|3 040) 60 | 20 


7, is a product of all conjugates of an irre- 
ducible factor of f(x — sa) over Q(Q). 
Therefore, the degree of h(x) must be 
divided by the degree of g(f). 

These improvements enable the efficient 
factorization of many problems that are extreme- 
ly ill-conditioned for the original algorithm. The 
timing data is shown in Table 9. 


5.2 Indefinite integration 

An incomplete version of indefinite integra- 
tion of some limited elementary functions 
is experimentally implemented in C language. 
This could be implemented in asir in the same 
way that the routine for uni-variate factoriza- 
tion over algebraic extension fields was imple- 
mented. The rewriting of this implementation 
in asir, and the improvement of its applicability 
to a wider range of elementary functions are left 
for the future. 

For rational functions, Rothstein’s 
algorithm??) (some literature refers to it as 
Trager-Rothstein’s algorithm) is employed. The 
integration of rational functions involves two 
steps: computation of the rational part of the 
integral, and computation of the logarithmic 
part of the integral. 

Rothstein’s algorithm and Trager’s 
algorithm?* find the rational part in the same 
way. But to compute the logarithmic part, 
the former only needs to perform GCD com- 
putations on several algebraic extension fields, 
whereas the latter must compute a sufficiently 
large extension field of the integrand’s 
denominator (in the worst case, the splitting 
field). Computing the splitting field of a poly- 
nomial is very hard if the degree of the poly- 
nomial is five or more. Thus, Rothstein’s 
algorithm is much more efficient than Trager’s 
algorithm. 


23) 
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For transcedental extension of rational 
functions, the so-called parallel Risch algrithm 
is implemented for logarithmic extension and 
exponential extension. The implemention is in- 
complete in many ways. This is because only 
two transcendentals, /og and exp, are supported; 
and because denominators are factorized only 
over the rational number and not in algebraic 
closures. (This is true for almost all computer 
algebra systems because multi-variate factoriza- 
tion over algebraically closed fields is highly 
expensive.) Also, no check is made regarding 
conformity to monomial extensions, which 
is a necessary condition for the algorithms. 

Integration of algebraically extended func- 
tions are not currently implemented. Thus, 
our implementation is only experimental, and in 
many ways is incomplete. Enhancement of our 
implementation is left for the future. 


6. Other features 
6.1 Floating-point number coefficients 
Floating-point number coefficients are ex- 
perimentally implemented in the risa basic 
algebraic engine. Asir supports the writing 
of floating-point number constants in asir 
programs. Many computer algebra systems 
support arbitrary precision floating-point 
numbers, the so-called big float. In risa, how- 
ever, only double precision floating-point 
numbers are available. Arithmetic operations 
between floating-point and rational numbers 
result in a floating number. Other operations 
that involve floating coefficients are not defined 
in risa. Factorization, of course, cannot be well- 
defined in a normal sense. GCD and resultant 
operations can apparently be defined. Doing 
such algebraic operations using approximate 
numbers (i.e. floating numbers) can easily 
produce incorrect results. Recently, approxi- 
mate algebraic computations’ have been pro- 
posed by Sasaki and Noda?®?, Sasaki and 
Sasaki?®), and Noda et al.?” that partly show 
how to extend algebraic operations and algo- 
rithms to numerical computation. 
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6.2 Graph plotting 

By using the X11"°'*) interface technology, 
a small sample program for writing implicit 
curves in two-dimensional space was recently 
incorporated into the risa basic algebraic engine. 
Numeric evaluation of a given polynomial in two 
indeterminates can be performed by several 
different CPUs. To specify such a distributed 
computation, several primitive functions are 
supported as asir functions. Displaying windows, 
of course, can be specified, and this will be 
enhanced into a practical graph plotting module. 


7. Theoretical contributions 

This chapter briefly describes our theoretical 
contributions to computer algebra research. The 
results of these theoretical contributions will 
form the foundation of future enhanced risa 
systems. 

The first section discusses our studies on 
polynomial factorization and shows several new 
methods of factorization for several domains. 


7.1 Factoring uni-variate polynomials over finite 
fields” ®) 

This section describes our work on the 
factorization of uni-variate polynomials over 
finite fields. To factor a given polynomial f(x) 
over GF(q), we enhanced the following sub- 
algorithms of Berlekamp’s factorization algo- 
rithm for uni-variate polynomials over finite 
fields: distinct degree decomposition and the 
finding of non-trivial factors. 

We introduced three new sub-algorithms: 

1) Determination of distinct degree factors in 
advance by using the concept of p-factors, 
2) test of separability for an arbitrary non- 
trivial element g(x) in Kernel (II — J), where 

II is the Frobenius mapping on GF(q)[x]/ 

(f(x)) and J is the identity mapping, and 
3) setting an alternative for ‘finding root’, that 

is, ‘finding non-trivial factors of G(x), i.e. 

the minimal polynomial of g’ instead of 

‘finding roots of G(x)’. 

The first sub-algorithm speeds up the GCD 


Note: X Window System, Version 11; The X Window System is a network transparent window system developed at MIT. 


It runs on a wide range of computing and graphics machines. 
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computation because the degree of the resultant 
factor is known in advance. The test of separa- 
bility determines how many irreducible factors 
are contained in GCD(f, g — s), where s is a root 
of G(x). In addition, this test gives non-trivial 
factors without the need to use the ‘finding 
roots’ procedure even if GCD(f, g —s) fails to 
produce an irreducible factor. 


7.2 Factoring multi-variate polynomials over 
algebraic extension fields”’»>” 

Kaltofen devised a way to find an extension 
field over which an absolutely irreducible factor 
can be obtained?!»3”), He employed an analytic 
argument to prove his answer. We developed an 
alternative algebraic proof of Kaltofen’s answer, 
and used the proof to devise several extensions 
to his answer and to investigate the properties of 
absolutely irreducible factors. We also showed 
how our results can be used to computate 
absolutely irreducible factors. {| Independently, 
Chistov and Grigor’ev also gave a similar alter- 
native proof of Kaltofen’s answer**?.| 

In connection with our work on the 
factorization of polynomials, we also looked for 
a way to represent extension fields and to 
determine the solvability of polynomials. 


7.3 Representation of extension fields, primitive 
elements>*? 

Computer algebra often involves algebraic 
extension fields, especially in the problems on 
polynomial factorization that were discussed 
in the previous two sections. We presented 
several mathematical results and new computa- 
tional methods for primitive elements and 
their minimal polynomials. In more detail, 
for a field Q(a,, ... , &) obtained by adjoin- 
ing algebraic numbers a,, ... , a to the ration- 
al number field Q, we showed that there is 
at least one vector s =(s,,... , 5,) of integers in 
distinct tN integers such that 

$1 QA, +FSyQ, +... +S pQy 
is a primitive element, where AN is the degree of 
Q(@,, ... , &) over Q. Moreover we presented 
a method of directly calculating a vector s 
such that 

$1 Q, tSoQ. +... +84, 
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is a primitive element. For a given polynomial 
f over Q, we presented methods of computing 
a primitive element of the splitting field of f 
over Q. We also presented methods of comput- 
ing a primitive element of the splitting field of 
f and its minimal polynomial over ‘). 


7.4 Solvability of polynomials”? 


Landau and Miller?” presented a method of 
determining the solvability of a monic irreducible 
polynomial over the integer domain in poly- 
nomial time. In their method, a series of poly- 
nomials is constructed so that the original 
problem is reduced to determining the solvability 
of new polynomials. We presented a more 
efficient method of finding such a series of 
polynomials. More precisely, we introduced the 
new notion of a series of blocks in the set of all 
roots of the original polynomial under the ac- 
tion of its Galois group. We then presented an 
efficient method of finding such a series of 
blocks by modifying Landau and Miller’s 
method of finding minimal non-primitive blocks. 

As for higher arithmetics, we studied how to 
deal with polynomial ideals and their related 
problems, for example, computing the Grébner 
basis and solving polynomial equations using 
the Grobner basis. 


7.5 Modular Grobner basis®”” 

First we will describe our work on comput- 
ing the Grobner basis. Sasaki and one of the 
authors proposed a new method of constructing 
the Grébner basis of a polynomial ideal overQ 
that uses a modular approach. Given a finite set 
of polynomials in Z [x,,... ,X,], this method 
calculates the Grébner bases over 7/(p;), i= 1, 
wy K, where py, ... , De are distinct primes. 
Then, it constructs the Grobner basis over Qby 
using the Chinese remainder algorithm and con- 
version of integers to rationals. We therefore 
called the method the modular Grébner method 
or modular Buchberger algorithm. By avoiding 
intermediate coefficient growth, this method 
enables the efficient calculation of a Grébner 
basis with large-sized coefficients. We proposed 
two algorithms, one is simple but probabilistic 
in that it may give a wrong basis such that 
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ideal(wrong basis) D ideal(true bais) with an 
extremely small possibility. The other algorithm 
is less simple but gives the correct basis. (As 
a related topic, solving a system of algebraic 
equations using the modular Grébner method is 
also discussed.) 

Next, we will mention our work on solving 
polynomial equations. 

7.6 Solving polynomial equations**”*”? 

We presented some new mathematical 
results and proposed several new algorithms 
for solving a system of algebraic equations 
algebraically. In this work, we found that many 
ideal theoretical arguments for the problem can 
be translated into their counterparts in the 
theory of linear maps. This translation provided 
straightforward descriptions of the U-resultant 
and the forms of solutions of systems. Our new 
algorithms apply algorithms for linear algebra 
to avoid repeated computations of Grébner 
bases under lexicographic order. In principle, 
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Appendices 

A. Sample programs 

A.1 Programs for calculating factorials 

1) /* A recursive program for factorials. */ 
def recf(X) {return !X 21: X * recf(X-1);} 
(see Table A-1.) 

2) /* A repetitive program for factorials. */ 
def f(X) | 
for (I=1, M=1; 1<=X; I++) M #*=]; 
return M; 


(see Table A-2.) 


Table A-1. Time to compute 7! using the above recursive 
program, recf(), for several values of n 

100 000 

126 520 


n 10 100 1 000 


Time (ms) | 5 30 1 150 


Table A-2. Time to compute ”! using the above repeti- 
tive program, f(n), for several values of n 


n 10 100 1000 | 100 000 


ak 
| 


Time (ms) 5 | 25 | 940 | 105 820 
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these algorithms require only one computation 
of a Grébner base under an arbitrary order. 


8. Conclusion 

This paper discussed some of the results of 
research in computer algebra. A small but 
rigorous computer algebra system, risa, was 
introduced. Risa is envisioned as an indispensable 
component of future scientific computation 


systems. It is mainly intended for professional 


use in very large and time consuming computa- 
tions, but is also usable by non-professionals. 
Risa is based on the mathematical foundations 
of computer algebra and the independent open 


module structure. As a general purpose com- 


puter algebra system, risa’s operating speed is 
competitive and in some cases outstanding. 

The next main target of our research is to 
connect risa to a numeric package, for example, 
Fujitsu’s SSL Il. A Grébner-basis library is also 
included in our plans. The numeric/algebraic 
hybridization of algorithmic levels is a challeng- 
ing and valuable research theme. 


A.2. A program for calculating 7 
1) def atl(M, D) | 
for (N = 1, SGN = 1, MM = Me*M, A =0, 
XN = idiv(D, M); XN; 
N+=2, XN = idiv(XN, MM), SGN « = —1) 
A + =(SGN#idiv( XN, N)); 
return A; 


def pi(D)} 
Y =10°D; X= 16*at1(5, Y)—4*at1(239, Y); 
return X; 


(see Table A-3.) 


Table A-3. Time to compute first n digits of 7 using the 
above program, pi(”), for several values of n 


n | 10 | 100 | 1000 100 000 
Time(ms)| 10 | 50 | 2065 | 185835 
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B. Data for timing table 
B.1 Data for uni-variate factorization 


u, =(8192x!9 + 20480x? + 58368x® — 161792x7 + 198656x® + 199680x> — 414848 x4 
— 4160x? +171816x? — 48556x + 469)(8192x!° + 12288x? + 66560x® — 22528x’ 
— 138240x® +572928x5 — 90496x* — 356032x3 + 113032x? +23420x — 8179) 
x (4096x!° + 8192x° + 1600x® — 20608x7 + 20032x® +87360x*° — 105904x4 
+ 18544x3 + 11888x? — 3416x + 1)(4096x!° +8192x? — 3008x® — 30848x7 
+21056x® + 146496x5 — 221360x* + 1232x3 + 144464x? — 78488x + 11993), 
[u, : from SIGSAM Problem 7] 

lig =X = 25x — 3500x5 —575000 + 21875x* — 3125. 

us =x*® 49x17 +.45x1© + 126x'5 + 189x1* +27x!3 — 540x!? — 1215x1! +1377! 
+ 15444x? +46899x® +90153x7 + 133893x® + 125388x> +29160x* — 32076x3 
+ 26244x? — 8748x +2916. 

us, = (x1® + 4x12 — 16x1! + 80x? +2x® + 160x7 +128x® — 160x5 +28x* — 48x3 + 128x? 
— 16x + 1)(x1© + 4x!2 + 16x!1! — 80x? +2x® — 160x7 +128x® + 160x> +28x4 
+48x° +128x? + 16x +1). 

Ug = (x*+ 351x3+ 27x? — 31x + 1)? (6x5 + 251x3 — 372x?2+ 15x — 323)°. 

Ug = (x4 + 351x5 + 27x? — 31x + 19° (6x5 + 251x3 — 372x? + 15x — 323)°. 

Uy = (x3 + 75x? + 68x + 1)3(x? + 15x +35)°(7x? +750x + 137)? (x +75)! 

Ug = (x? + 75x? + 68x + 1)5(x? + 15x +35)!9(7x? + 750x + 137)!5(x +75)?°. 
[w2-Ug: from Wang-Trager!” } 

B.2 Data for multi-variate factorization 

m, = (xy +z + 10)(xz + y + 30)(x + yz +20). 

m = (x3 (z+ y) +z — 11)(x2 (2? + y?) + y +90). 

Ms = (x2 y? +xz4+x+z)(x3+ xyz t+ y? + yz). 

mg = (x?z + y4*z? + 5)(xy? + 2? x3 y +2? + 3)c3 y* +27). 

ms = (3u? x3 y4z + xz? + y2z? + 19y?) (U2 y4z2 + x2z + 5)(u2 + x3 y4 + 27). 

me = (w*z® —w>x3 y — w2 x? y2z? +x52 — xt y? + y2z3)y(— x5z3 +x? y3 + yz) 
x (wixS y® — wtz? + w2x3 y + xy?2z?), 

mz=(xty+z—2)2 (x+y +z — 3). 

Me = (— 29! —wl?z2? +y!8 — y!4 + x2 y? 43:21 +? K— 15 p22) + 29w4*x!2 2? + 21x72? 


+ 3wi5y?0), 
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My = u*xz?(18u2w3xz? + 10u2wxy? + 15uz? + 6w? y3z? )\(25u2wr yz? + 32u2 wt y4z3 
— 48u? x? y3z3 —2u2wx?y? + 44uwxytz* — 8uwx 24 +4w?x 
+ llw?xy + 12y3z?), 
M19 =(31lu?xz + 35w?y? +40wx? + 6xy)(42u2w2y? + 47u2w2z +22u?2w? + 9u2wx? 
+ 2lu?wxyz + 37u? y?z +u2w?xy?z? + 8u2w2z? +24u2wxy2z? +24u? x? yz? 
+ 12u?xy?z? + 13uw?x? y? +27uw?x?y + 39uwxz + 43ux2y + 44uw? z? + 37w2 xy 
+29w?y? +31w2 yz? +12wx? yz + 43wxyz? +22xy? +23xyz+24xy + 41y?z). 
m,, =(13u3w? xyz? — 4uxy? — wz? — 47xy)(— 36u?wi xyz? —31u2wiy? + 20u? w? x? y?z? 
— 36u? wxy?z + 46u2wx + 9u?y? — 36uw? y? + 9uwy? — Suwx? y? + 48uwx3 y?2z 
+23uwx? y? — 43ux3y3z3 — 46ux?y? + 29wexy3z? — 14w3x3y3z? —45x3 — 8xy?). 
My =(x+y +z —3)3. 
M13 =(2wz + 45x3 — 9y3 — y? + 327) (w2z3 — w? + 47 xy). 
Mig =C1Sxt yp" +41xty? =S 7x" + 26K y* +38 x*y* —29x7 4° —229" A3x* p* — 22x" 
+35x3y +11y?). 
M15 =x® y?z?(1l2w? xyz? — wz? +w? — 29x — 3xy?)(14w2y? +2wz + 18x3y — 8xy? 
—y? +323). 
[m,-m,5: from Wang!® ] 
eA 4 =e =F. 
[from Davenport”? ] 
py = (Ox? +0 + Ty + Qe tee. 
So = (p* + x?) +x? F (pt 4x)? 
53 =(ztytxtl)(z—ytxt4)(z —2y+x+7). 
Sq = (vy +x? + 5)(6y + 2x3 + 31)3 (xy? + By — x)5. 
Sg = (22 + xyz + x7)? (327 + (yp? +x)z — 4)3(z3 +z? + (x — ly)’. 
Se=(By4 +x +5)(6y? +2x +31)? (x? +x4+1)y? —p tx +8) (xy +8y +x)? 


[51-86 : from Wang-Trager!» ] 


B.3 Data for uni-variate factorization over algebraic extension fields 
a, =x* —x +3. 
@, =x* + 2, 
ix" =x + 1. 
Giga x° + 3x" + Gx* +x? — 3x7 + 12x + 16. 


a, =x? — 15x* — 87x? — 125, 
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ag =x? — 54. 


a, xP 19. 


ag = x?+ x7. 
[a,-a, : from Abbott et al.'?)] 
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Boron/Magnesium Sintered Composites 
for Computer Peripherals 


@ Eiji Horikoshi @ Takehiko Sato (Manuscript received June 3, 1991) 


A magnesium-based sintered composite, reinforced with boron particles, has been developed 
that has an increased modulus of elasticity and controllable thermal expansion coefficient. 
A 6vol%boron/magnesium-9wt% aluminum composite has an excellent modulus of elasticity 
1.3 times that of the ordinary magnesium alloy and a thermal expansion coefficient close to 


aluminum alloy. This composite also has improved machinability as the size of the boron 
particles is controlled. A head-arm component for a disk drive was experimentally produced 
using this composite, and it was found that the composite causes greatly decreased thermal 


off-track error in the disk drive. 


1. Introduction 

Downsizing is the strongest trend in com- 
ponent design today. 

Typical of computer peripherals illustrating 
this trend is the disk drive in which components, 
such as the head actuator arms and carriages, 
require lightweight, high-strength materials to 
meet needs for larger capacity and high-speed 
data access. 

In the disk drive, many components made 
of aluminum (AI) alloy have been used because 
of its lightweight, high-strength properties. If 
there is any material which is lightweight, high- 
strength and has a thermal expansion coefficient 
close to that of Al alloy, it will become a suita- 
ble material for the components. 

Magnesium-based (Mg-based) alloys have the 
greatest potential in this regard if their modulus 
of elasticity can be increased and if their thermal 
expansion coefficient can be brought close to 
that of Al alloy. 

Approaches to improving the modulus of 
elasticity of these Mg-based alloys have used new 
techniques such as squeeze and pressure casting. 
These techniques are combined with composite 
material processing using reinforcing materials 
such as carbon fibers, sillicon-carbide whiskers, 
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and alumina particles'»?). Fortunately, the 
thermal expansion coefficient of these compos- 
ite materials can be controlled by varying the 
amount of reinforcing materials”). 

But precise components with uniform alloy 
structure have not yet been fabricated using 
these composite materials. 

The sintering of near-net shapes produces 
small precise components with a uniform alloy 
structure. Very little work, however, has been 
done on Mg-based alloys*®?. To develop a Mg- 
based alloy with an improved modulus of elas- 
ticity in the form of a precision component, the 
authors combined a composite material tech- 
nique with sintering. 

Since Al has a large solid solubility in Mg”, 
a Mg-Al alloy matrix was selected to obtain 
solid-solution hardening. Boron particles were 
used for reinforcement because boron does not 
react with Mg>?. 

Generally, composite materials show less 
machinability because they contain a large num- 
ber of whiskers, fibers or particles which are 
very hard. Despite progress in powder metallur- 
gy, it is not yet possible to produce certain 
geometrical features, such as transverse holes 
and undercuts, which thus frequently necessitate 
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machining, particularly drilling. 

Against this background, the authors devel- 
oped the boron/Mg-Al sintered composite having 
an increased modulus of elasticity, controllable 
termal expansion coefficient and excellent ma- 
chinability. Then the composite was evaluated 
as a material for a head-arm component. 


2. Experiment 

A Mg-4wt%Al alloy was used in evaluating 
the sintering atmosphere for Mg-Al alloys. 
Boron particles were then used to reinforce the 
Mg6wt%Al sintered alloy. The relationship 
between boron content and the modulus of 
elasticity was studied. To determine the depend- 
ence of the modulus of elasticity on the Al 
content, the Al content of the boron/Mg sin- 
tered composite system was varied. The depend- 
ence of the boron content on the thermal ex- 
pansion coefficient was then measured. 

The following sintering experiments were 
performed. 

Table 1 shows the specifications for the 
starting powders, ground Mg powder (average 
particle size: 50 um), gas-atomized Al powder 
(average particle size: 30 wm), and crushed or 
reduced boron. To study the effect of the boron 
particle size on machinability, the average parti- 
cle size of boron was varied from | wm to 15 um. 

In mixing, special attention was paid to 
obtain a uniform mixture of coarse Mg powder 
and fine boron powder. Powders were mixed for 
two hours in a V-shell blender or a grinding 
mixer with ethyl alcohol. To investigate the 
optimum composition or thermal expansion 
coefficient, the boron content was varied be- 
tween O vol% and 20 vol%. Then the Al content 
was varied between 0 wt% and 18 wt% to deter- 
mine the composition with the highest modulus 
of elasticity. 

Mixtures were compacted into a 97 mm 
x6mmx4mm tensile test piece under pres- 
sure of 400 MPa. 

Mg-6wt%Al samples were sintered at 560 °C 
to 620°C for one hour in an argon, hydrogen, 
or nitrogen atmosphere. Samples in which the 
ratios of boron and aluminum were varied were 
sintered under the conditions optimized for the 
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Table 1. Starting powder specification 


Powder Purity (%) been Fabrication 
Mg | 99.9 50 | Ground 
Al | 99.9 30 Gas-atomized 

Boron 99.8 15 Crushed 
Boron | 99.7 | 10 Crushed 
Boron 99:5 | 5 Crushed 
Boron 95.7 1 Reduced 


a a a 


Mg-6wtAl alloy. 

The mechanical properties were measured 
with an Instron Universal Test Machine at 
a cross-head speed of 1 mm/min. 

Optical microscopy and XMA analysis were 
then used to evaluate the microstructures. 

The machinability indices were determined 
by measuring the tool wear, cutting resistance, 
and cut chips during machining. Tool wear was 
determined by the outer corner wear of the 
cemented-carbide drill bits, 2 mm in dia, after 
three holes were drilled in a test piece 6 mm 
thick. Outer corner wear was measured as the 
difference between the virgin edge and the worn 
edge on the corner of the drill bit. The cutting 
resistance was measured by an oblique cutting 
test with sintered diamond cutting tools. The 
oblique cutting test was done with a lathe at 
a cutting speed of 200 m/min, a cutting depth of 
0.5 mm, and a cutting feed of 0.1 mm/rev. Worn 
drill bits and cut chips were evaluated using SEM. 

The thermal off-track error is thought to be 
caused by the difference in the thermal expan- 
sion coefficients between the disk material 
(aluminum) and the head-arm material (magne- 
sium). To measure the improvement in thermal 
off-track error, a new head-arm was made using 
6vol%boron/Mg-9wt%Al having a thermal ex- 
pansion coefficient close to that of aluminum. 
The arm size is 57 mm x 24 mm x 4mm. It has 
six small tapped holes 0.8 mm in dia. 


3. Results and discussion 
3.1 Sintering conditions 

Figure 1 shows the relationship between the 
sintering temperature and the modulus of elas- 
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40 


35 


Modulus of elasticity (GPa) 


560 580 600 620 
Sintering temperature (°C) 


Fig. 1—Relationship between sintering temperature and 
the modulus of elasticity of the Mg-6wt%Al alloy 
sintered in an argon, hydrogen, or nitrogen 
atmosphere. 


ticity of the Mg-6wt%Al alloy sintered for one 
hour in an argon, hydrogen, or nitrogen atmos- 
phere. The maximum values for the modulus of 
elasticity are different for each atmosphere. The 
sample sintered at 600 °C for one hour in argon 
has the highest modulus of elasticity, 39 GPa. 
When the sample is sintered above 600 °C, the 
microstructure becomes coarse and the modulus 
of elasticity becomes lower. When the sample is 
sintered below 600°C, the densification is 
insufficient. Consequently, the modulus of 
elasticity becomes lower. 

The sample sintered in hydrogen has a max- 
imum modulus of elasticity at a temperature 
20 °C lower than the sample sintered in argon or 
nitrogen. Hydrogen makes the Mgtwt%Al 
microstructure too coarse because the powders 
become more active in hydrogen. This is why 
the sample sintered in hydrogen has lower 
modulus of elasticity than the sample sintered in 
argon®?, 

In the microstructure of the sample sintered 
in nitrogen, the formation of magnesium nitride 
(Mg3N>) is observed at the grain boundaries®?. 
Mg3 N2 is well known to be brittle*?. Thus, the 
modulus of elasticity is less than that of the 
sample sintered in argon. 
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Fig. 2—Relationship between the modulus of elasticity 
and boron content in the Mg-6wt%Al alloy. 
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Fig. 3—Relationship between density and boron content. 


Sintering in argon at 600°C is therefore 
optimum for the Mg-6wt%Al alloy, since no 
brittle phases are formed and grain growth and 
densification are balanced. 


3.2 Optimum composition and thermal expan- 

sion coefficient 

3.2.1 Boron content 

Figure 2 shows the relationship between the 
modulus of elasticity and boron content in the 
Mg-6wt%Al alloy sintered in argon at 600 °C for 
one hour. The average particle size of the boron 
is 15 um. The modulus of elasticity increased 
with the addition of boron, probably because 
boron particles reinforce the matrix at the com- 
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posion of up to 10% by volume. When the boron 
content increases above this, the modulus of 
elasticity decreases. The reason is as follows. 

The green and sintered densities are shown 
in Fig. 3 as a function of the boron content. The 
green density decreases abruptly when the boron 
content exceeds 10% by volume because boron 
particles tend to reduce the compactibility of 
the green sample. Hard boron particles reduce 
the smooth movement of the compacting dies. 
Correspondingly, the sintered density decreases, 
which results in reducing the modulus of elastic- 
ity. 

3.2.2 Aluminum content 

The dependence of the modulus of elasticity 
on Al content is shown in Fig. 4. The modulus 
of elasticity has a maximum value of 63 GPa, 
1.4 times higher than that of the cast Mg-Al 
alloy without boron, when the AI content is 
9% by weight. 

A typical microstructure of the composite, 
10vol%boron/Mg-9wt%Al, exhibiting a maxi- 
mum modulus value, is shown in Fig. 5. 

Figure 6 shows the result of XMA analysis 
for the samples containing 6%, and 9% Al by 
weight, and 10% boron by volume. The X-ray 
was scanned across boron particles. Both sam- 
ples have a uniform distribution of Al and Mg in 
the matrix. However, the sample containing 9% 
Al by weight, the composite with the highest 
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Fig. 4—Dependence of the modulus of elasticity on the 


aluminum content. 
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Fig. 5—Typical microstructure of the 


10vol%boron/Mg-9wt%Al composite. a 
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a) 6 wt% Al b) 9 wt% Al 
Fig. 6—Results of XMA analysis for two samples, The 
X-ray was scanned across boron particles. 
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Fig. 7—Dependence of the thermal expansion coefficient 
on the boron content. 
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Virgin 
_ a? ,. 


Drilled hole 


Fig. 8—SEM photographs of worn bits and optical photographs of holes. 


modulus of elasticity in this study, has an Al- 
rich layer several microns thick around the 
boron particles. This concentration may pro- 
mote the adhesion of boron and Mg, resulting in 
a boron/Mg-Al composite with a high modulus 
of elasticity. 

3.2.3. Thermal expansion coefficient 

The dependence of the thermal expansion 
coefficient on the boron content for the sample 
with 9% Al by weight is shown in Fig. 7. The 
thermal expansion coefficient decreases linearly 
with increasing boron content, indicating that 
the thermal expansion coefficient can also be 
controlled by the boron content. This provides 
more flexibility in how the alloy can be used. 
For example, the thermal expansion coefficient 
of a 6vol%boron/Mg-9wt%Al composite is close 
to that of the Al alloy (24 x 10°°/°C). 


3.3 Effect of the boron particle size on ma- 
chinability 
3.3.1 Tool wear 
Perhaps the most meaningful measure of 
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Fig. 9—Relationship between boron particle size and 
outer corner wear. 


machinability is tool wear. 

Figure 8 shows bits before and after three 
holes were drilled in the test piece, together with 
the holes through which bits exited. The cutting 
edge of the bit used for drilling the 1-um boron 
sample had little damage, but that for drilling 
the 5-um boron sample was heavily damaged, 
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especially at the outer corners. The hole features 
reflect these facts. The test piece with 5-um 
boron particles shows chips around the hole. But 
the test piece with l-um boron particles shows 
no chips around the hole. 

Figure 9 summarizes the outer corner wear 
as a function of boron particle size at different 
drilling speeds. Fine boron particles improve 
machinability. 

The tool life is increased by over 30 times by 
using l-um boron particles instead of 5-um par- 
ticles. 

3.3.2 Cutting resistance 

The cutting resistance during machining can 
also be used to predict machinability. Low cut- 
ting resistance generally means low tool wear 
and good machinability”?. 

Figure 10 shows the cutting resistances in an 
oblique-cutting test for samples with different 
boron particle sizes. The Fp curve crosses the Fy 
curve when the boron particle size exceeds 5 um. 
This may be caused by the larger boron particles 
tending to remain not in the cut chip but in the 
workpiece. The cutting resistance F, decreases 
with boron particle size, in agreement with the 
machinability obtained from the tool wear. 

3.3.3 Cut chips 

The sample with 1-um boron particles gives 
long curled chips, a sign of good machinability 
as shown in Fig. 11. Chips of the sample with 


a) l-um boron 


5-um boron particles are cracked and broken. 

Figure 12 shows typical microstructures for 
sintered composites. In sample a), the fine boron 
particles (1 wm) collect together to from a net- 
work-like pattern in the grain boundaries, but 
individual particles are microscopically isolated 
in the sample with 6 vol% boron. 

As shown in sample b) of Fig. 12, the larger 
boron particles (5 wm) are dispersed at random 
in the alloy matrix. In b) of Fig. 13, note how 
even one large boron particle can occupy a large 


Cutting resistance (NV) 


Depth: 0.5mm 
Feed : 0.1 mm/rev 
Speed: 200 m/min 


0 5 10 15 
Boron particle size (um) 
Fig. 10—Relationship between the cutting resistance and 
boron particle size. 


Fig. 11—Cut chips. 


FUJITSU Sci. Tech. J., 27, 4, (December 1991) 


365 


E. Horikoshi, and T. Sato: Boron/Magnesium Sintered Composites. . . 


os 
a) 1-uwm boron b) 5-um boron 50 wm 


Fig. 12—Optical microstructures of the 6vol%boron/Mg-9wt%Al sintered composites. 


lcm 


Fig. 14—Disk drive head actuator arm. 


a) l-um boron b) 5-um boron 
Fig. 13—Cutting chips. 


Disk Inner Outer 


Head-arm 
part of the chip. The weak interface between the 
boron and the alloy matrix may cause cracks 
under the bending stress caused by the cutting 
tool. The chip is then easily broken. This ten- 
dency increases as the ratio of cutting depth to 
boron particle size decreases. A small ratio 
causes boron particles to fall off and increases 
tool wear. Fine boron particles, such as those in 
sample a) of Fig. 13, do not cause this problem. 


3.4 Application to a disk drive 
To ascertain the improvement in machinabil- 
ity and the controllable thermal expansion coef- 


Fig. 15—Schematic drawing of measuring thermal 
ficient, a head-arm for a disk drive was made off-track. 
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applying the new composite material, then the 
thermal off-track error was measured. 

Figure 14 shows a head actuator arm. An 
experimental schema for measuring thermal off- 
track is shown in Fig. 15. This head-arm has two 
read-write heads on one side and a total of four 
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Fig. 16—Sample measurement of off-track error. 
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heads on both sides. Four head-arms with 16 
heads were assembled in a disk enclosure. The 
distance between outer and inner heads is 
21 mm. 

Figure 16a) shows a typical example of off- 
track measurement. As a reference, off-track 
data of a conventional head-arm component of 
the MgAZ91 alloy (Mg-9wt%Al-lwt%Zn) is 
shown in Fig. 16b). The vertical fourth position 
of the outer head is the servo head used as the 
reference position. In the five upper head posi- 
tions, the absolute values of off-track 1 um to 
1.5 um are observed for the Mg AZ91 alloy. In 
the 6vol%boron/Mg-9wt%Al composite, the 
absolute value of the off-track decreased to 
0.5 wm. Clearly, the off-track error decreased by 
1/2 to 1/3 by using the composite. Here, the 
reason why the off-track value does not become 
zero despite using the same thermal expansion 
coefficient material as the disk material might be 
because the temperature in the head-arm is slight- 
ly lower than that in the disk. The off-track 
error of the two lower positions (1 and 2) are 
larger than that of the upper positions. The 
larger off-track is thought to be caused by the 
thermal deformation in the mechanical struc- 
ture. 


4. Conclusion 
The authors developed a _boron/Mg-Al 

sintered composite and applied it to the head- 

arm component of a disk drive. 
The results can be summarized as follows: 

1) A new boron/Mg-Al composite that exhibits 
an increased modulus of elasticity was 
developed by sintering. A maximum modu- 
lus of elasticity of 63 GPa is achieved in 
a 10vol%boron/Mg-9wt%Al composite. This 
modulus of elasticity is 1.4 times greater 
than that of the cast Mg-Al alloy. 

2) The thermal expansion coefficient of the 
boron/Mg-Al system decreases linearly with 
increasing boron content. When the sample 
contains 6 vol% boron, the thermal expan- 
sion of the composite is close to that of Al. 

3) The larger the boron particles, the greater 
the tool wear. Tool life is increased over 30 
times using material containing 1-um boron 
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4) 


particles instead of 5-yum_ particles, for 
example. 

When the head-arm component is made of 
this sintered 6vol%boron/Mg-9wt%Al com- 
posite, the thermal off-track error is reduced 
by 1/2 to 1/3 of the error found in head- 
arms made of conventional materials. 
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The need for a fast, miniaturized, optical switch with low insertion loss has increased with the 
proliferation of optical communications and optical signal processing. This paper describes 
a moving-fiber optical switch consisting of a LiNbO3 piezoelectric actuator having an anti- 
polarized ferroelectric laminar domain (inversion layer). A new construction technique 
simplifies optical axis alignment and makes fiber end-face polishing unnecessary. A 2 x 2 
moving -fiber optical switch constructed using the above techniques has an insertion loss 
within 0.8 dB and a switching time less than 10 ms. 


1. Introduction 

The need for a fast, compact, and inex- 
pensive switch with low insertion loss has 
increased with the accelerating growth of optical 
communications and signal processing. Although 
most optical switches use mechanical com- 
ponents because of their low loss and optical 
mode independency, these components require 
precise assembly and adjustment, and are 
difficult to mass produce. 

There are three types of mechanical switches: 
switches that use moving prisms, switches 
that use moving mirrors, and switches that 
move the fibers. Moving-fiber switches have 
following advantages: 

1) Simple structure and low component count 
2) Low insertion loss and fast switching speed 
However, moving-fiber switches require precise 
fiber alignment. 

To minimize size and cost, we derived the 
acceptable positioning margins from the optical 
losses caused by the following: the Fresnel 
reflection at the fiber end-faces, transverse offset, 
angular misalignment, longitudinal separation of 
the fiber ends, and incomplete fiber end-faces. 
For the new switch we chose a fifty-micron 
diameter graded index (GI) optical multimode 
fiber and an LED light source with a wavelength 
of 1.3 wm. 
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We developed a new switch structure that 
secures the fibers in V-shaped grooves aniso- 
tropically etched in silicon and a method of 
precisely mounting the fibers using a water- 
soluble resin. Array precision is obtained by 
cutting fibers using a dicing saw. This produces 
smooth fiber end-faces without the need for 
polishing. 

A piezoelectric bimorph actuator compares 
favorably with electromagnetic actuators in 
terms of size and speed. However, conventional 
piezoelectric ceramic actuators have the dis- 
advantages of hysteresis and creep (displacement 
during change-over) due to adhesion of the two 
ceramic plates. The LiNbO3 piezoelectric 
actuator having an antipolarized ferroelectric 
laminar domain (inversion layer) proposed 
by Shimizu et al!’?) has no adhesion layer and 
executes fast and precise fiber movement. 

We developed a 2 x 2 moving-fiber optical 
switch using multimode fiber. This switch 
features a switching time of 10 ms or less, and 
a loss of 0.8 dB or less. 


2. Optical switching losses 

Fresnel reflection at the fiber end-faces, 
transverse offset, angular misalignment, 
longitudinal separation of the fiber ends, and 
incomplete fiber end-faces contribute to the 
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0.3 GI fiber (core 50 um) 


Loss (dB) 
= 


0.1 


0 
0 10 20 30 40 50 60 
Separation d (um) 


Fig. 1—Optical coupling loss due to fiber end-separation. 


GI fiber (core 50 um) 


Loss (dB) 


Offset X (um) 


Fig. 2—Optical coupling loss due to transverse offset. 


Table 1. Estimated losses of moving-fiber optical switch 


Loss factor Loss Condition 
Fresnel reflection at fiber = 
end-faces | 0.31 dB n=1.5 
Fiber end-separation | 0.1 dB Gap: 40 um 
Transverse offset 0.1 dB Offset: 5 um 


overall coupling loss in fiber splices?) Figures | 
and 2 show the optical coupling losses in 50 um 
diameter GI fiber due to fiber end-separation 
and transverse offset, respectively. An LED 
emitting at 1.3 wm was used for the measure- 
ment. Using an optical fiber mode scrambler 
allows only traveling modes to pass through the 
fiber. Figure 1 shows the optical coupling 
loss due to the spacing between the fiber end- 
faces in the changeover section. A fiber separa- 
tion of 40 um increases the loss by 0.1 dB. 
Figure 2 shows the optical coupling loss due to 
transverse offset for two values of fiber end- 
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Fig. 3—Optical travelling loss due to fiber bending. 
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Fig. 4—Optical switching circuit. 


separation. Table 1 shows the estimations of the 
optical coupling losses including the Fresnel 
reflection loss at the fiber end-faces. For an 
optical switch, losses due to angular misalign- 
ment, roughness, and contamination of end- 
faces must be added. 

The relationship between fiber bending radii 
and traveling losses is important factors when 
miniaturizing a moving fiber switch. Figure 3 
shows the experimental losses for traveling 
modes. 


3. Structure of optical switch 

Figure 4 shows the switching circuits for an 
experimental 2x2 optical switch. When the 
switch is off, fiber 1 is connected to fiber 4 and 
fiber 2 is connected to fiber 3. When the switch 
is on, fiber 1 is connected to fiber 3 and fiber 2 
is connected to fiber 4. The loss in the optical 
path between fibers 2 and 3 is greater than in 
the other paths because light passes through the 
changeover section twice. 

Figure 5 shows the structure of the optical 
switch. One fiber array is secured to a stationary 
table, while the other is mounted on top of the 
bimorph actuator. Switching is accomplished by 
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GI fibers 


Actuator 
(LiNbO,) 


Stationary table 
(LiNbO ) 


Fig, 5—Structure of the optical switch. 


Holder 


Ferroelectric polarity 


—— u Displacement 


Expansion layer 


LiNbO, Electrode Compression layer 


Fig. 6—Principle of piezoelectric LiNbO3 actuator having 
oppositely-polarized layers. 


a 170 um lateral displacement of the actuator. 

A piezoelectric bimorph (see Fig. 6) was 
selected for the actuator because of its smaller 
size and higher speed relative to solenoid-type 
actuators. A voltage applied across the two 
oppositely polarized layers of the actuator’s 
piezoelectric plate causes one layer to expand 
and the other to contract. These movements 
bend the plate and reposition the movable array 
relative to the stationary array. The two fiber 
arrays are separated by a 40 um gap and their 
optical axes are aligned. The fibers are arranged 
170 um apart in V-shaped grooves that are 
anisotropically etched in a silicon substrate. 
The fibers are 50 um core GI fiber. 
4. Piezoelectric bimorph actuator!”**? 

A piezoelectric bimorph actuator consisting 
of a reverse-polarized LiNbO3 crystal plate was 
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developed to meet the requirements of precision 
positioning, simplified structure, and faster 
switching. LiNbO3 crystal produces oppositely 
polarized layers when heated slightly below 
the Curie temperature. Unlike other types of 
ceramic piezoelectric bimorphs, the LiNbO3 
bimorph can be fabricated on a single plate. 


4.1 Reverse-polarized LiNbO3 crystal 

An antipolar ferroelectric laminar domain 
(inversion layer) can be obtained from single 
LiNbO3 crystals by Ti diffusion or by heat 
treatment at a temperature slightly below 
the Curie point. These polarization inversion 
phenomena occur without an external electrical 
field. 

Piezoelectric devices based on such an inver- 
sion layer include resonators, filters, ultrasonic 
transducers, and actuators. Because the polarity 
of ferroelectricity in a piezoelectric plate defines 
the direction of the strain caused by an electric 
field, applying a voltage to both sides of the 
LiNbO, plate causes strain in opposite direc- 
tions in the inversion and noninversion domains, 
and therefore produces mechanical displace- 
ment. 

Conventional piezoelectric actuators consist 
of two piezoelectric plates of opposite polarity 
cemented together. These actuators are difficult 
to fabricate and are subject to irregularity due to 
variations in characteristics such as hysteresis 
and creep. Actuators based on LiNbO 3 with an 
inversion layer are better suited to volume pro- 
duction because they can be manufactured by 
heat treatment without cementing, and have no 
hysteresis or creep. 

To make the new actuator, an inversion 
layer was formed on 140 °-rotated Y-cut 
LiNbO3 with a maximum piezoelectric strain 
constant of d23 (d23 = —30 x 107!* C/N). This 
was achieved by heating to 1155 °C for 24 
hours in a mixture of oxygen and nitrogen after 
Ti diffusion. The resulting crystal was cut 
parallel to the X-axis, then etched in a mixture 
of boiling HNO3 and HF. The inversion layer of 
the crystal cross section progresses to the center 
of the plate. Figure 7 shows the cross section 
of 140 °-rotated Y-cut LiNbO3. 
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Fig. 7—Cross section of a 200-um thick 140 °-rotated 
Y-cut LiNbOg plate with inversion layer. 


Admittance (dB) 


2.5 3.0 3.5 


Frequency (kHz) 


Fig. 8—Bending vibration of 140 °-rotated Y-cut LiNbO3 
plate with inversion layer. 


To evaluate the material, the bending vibra- 
tion response was measured for a LiNbO; plate 
cut from a wafer with electrode layers deposited 
on both sides. The existence of the inversion 
layer was confirmed by measuring the capaci- 
tance ratio Cg/C of bending vibration and by 
comparing it with the theoretical value. The 
capacitance ratio was determined from the 
resonant frequency F, and the antiresonant 
frequency f, as shown in Equation (1). 


Ca/C= FAIA =F?) 
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Fig. 9—Structure of piezoelectric actuator. 


F, and F, were determined from the admittance 
or impedance. 

If the inversion layer progresses to the center 
of the plate, the value of Cq/C should agree with 
the theoretical value. Figure 8 shows the bend- 
ing vibration response of a 140 °-rotated Y-cut 
LiNbO, plate. The measured Cqy/C was 7.37, 
somewhat higher than the theoretical value of 
6.42. 


4.2 Bending actuator structure 

Figure 9 shows the basic structure of the 
actuator. The actuator consists of a LiNbO; 
plate that flexes when a voltage is applied 
across it. Three GI fibers are mounted on the 
top of the plate so that their alignment with 
three corresponding fibers on a stationary table 
changes when the plate is flexed. Adjacent fibers 
are secured 170 yum apart in V-shaped grooves 
anisotropically etched in Si. This spacing means 
that the optical path can be switched by a 170-um 
displacement of the LiNbO; plate. 

The gap between opposing fiber arrays is 
40 um. The optical array weighs about 10 mg. 
Stoppers on both sides of the plate ensure that 
the plate is moved by the correct amounts. 
These stoppers also dampen plate vibration and 
protect against external vibration. 


4.3 Bending displacement 

A 26x5x0.2mm plate was cut from a 
LiNbO; wafer on which Au/NiCr electrode 
layers were deposited on both sides. One milli- 
meter of one end was then inserted in a plated 
metallic holder and soldered in place. Leads 
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were bounded to both sides of the plate. 
A 2x3mm square reflector plate weighing 
about 3mg was bonded to the top of the 
actuator to enable displacement measurement 
using a laser-based displacement gauge. A 140 °- 
rotated Y-cut LiNbO; plate was used for the 
actuator. Figure 10 shows that the measurements 
agree well with the values calculated from Equa- 
tion (2). No hysteresis was observed. 


u = (3/2): |dy3 |-(1/t)* - V, 


where 

u: Displacement 

d,3: Piezoelectric strain constant 
V: Applied voltage 

t: Thickness of LiNbO; plate 
l: Length of LiNbO; plate. 
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Fig. 10—Displacement vs. applied voltage for 
140 °-rotated Y-cut LiNbOg actuator. 
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Fig, 11—Displacements of two types of actuators. 
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A voltage of 220 V was applied, and the 
creep in actuator displacement was monitored 
for 30 min. Two types of actuators were made. 
The LiNbO; plate of the first type was soldered 
to the base, and that of the other was fixed 
using resin. As can be seen in Figure 11, creep in 
the soldered actuator did not exceed 0.2 um, 
but was significant in the resin fixed actuator. 

Both actuators produced displacements with 
good linearity for the applied voltage. The 
actuators were then modified for a displacement 
of 170 um. 


4.4 Damping of actuator response. 

Optical LAN applications require that loss is 
within 1.5 dB, that the switching time is within 
10 ms, and that vibration after application of a 
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c) With stoppers and load 


Fig. 12—Actuator responses to a step voltage. 
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Fig. 13—Chatter in LiNbO3 plate damped 
by various loads. 


step voltage must be within 4 wm in 10 ms. The 
primary response to a high step voltage without 
stoppers or damping is shown in Fig. 12a). 
This vibration stops after about 200 ms, which 
is too long for an optical switch actuator. We 
then placed stoppers on both sides of the 
LiNbO; plate as shown in Fig. 9 and applied 
a voltage sufficient for a 170-um displacement 
and a 450mg stress between the plate and 
stoppers. Without stoppers, this voltage would 
produce a 250-um displacement. The stoppers 
and the additional stress protect against external 
vibration. Figure 12b) shows the response 
of the actuator with stoppers. The impact 
between the LiNbO; plate and the stoppers 
produces chatter composed mainly of secondary 
resonances, The vibration fell to 4 um or less 
after 30 ms. Therefore, an additional load of 
soft resin was bonded to the center of the 
LiNbO 3 plate to suppress chatter (see Fig. 9). 
Figure 13 shows the relationship between the 
load weight and chatter damping time. A load 
of 40 mg or more damped chatter within 10 ms. 
Figure 12c) shows how a 45 mg load damps 
actuator chatter to within the required limits. 


4.5 Actuator specifications 


A stress of 2g or more is needed to 
move three fibers at the required speed. 
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Table 2. Optical switch actuator specifications 


Item Specification 
Displacement > 170 ym 
Switching time <10ms 
Position accuracy < 4 um 
Produced stress >2zg 
Drive voltage 500 V 
Drive current 0.5 nA 


10mm 


Fig. 14—Actuator drive circuit. 


Therefore, the required actuator dimensions are 
26x5x0.2mm. Plate vibration is generated 
when the plate hits the stopper, resulting in 
fiber-axis deviation. Because the loss due to 
vibration must not exceed 0.3 dB, the axis 
deviation must not exceed 4 wm (see Figs | and 
2). Therefore, plate vibration due to a step 
voltage must be damped to 4 um or less within 
10 ms. (see Table 2). 


4.6 Electrical drive circuit 

To move the fibers, when chatter is within 
acceptable limits, this piezoelectric actuator 
requires at least 500 V but no more than 0.5 nA. 

This drive voltage can easily be supplied by 
the miniature inverter and small step-up trans- 
former shown in Fig. 14. The input voltage of 
this circuit is 5 V, and the input current is 
100 mA. The circuit measures 31 x 20 x 5.3 mm, 
and easily fits into the optical switch enclosure. 


5. Optical switching path 


Important requirements for an_ optical 
switch include precise spacing of the fibers on 
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the actuator and stationary table, smooth 
fiber end-faces, and precise alignment of the 
moving fibers with respect to the stationary 
fibers. 

For precise spacing, the fibers were arranged 
in V-shaped grooves anisotropically etched in 
silicon. For smooth fiber end-faces the fibers 
were precision-cut using a dicing saw. For precise 
alignment, the fibers were mounted using a 
water-soluble resin. 


5.1 Construction 

Figure 15 shows how the optical path is 
constructed. First, a silicon crystal plate holds 
the optical fibers in V-shaped grooves. Second, 
the fiber array is secured to the transfer sub- 
strate using a water soluble resin. Third, the 
fiber array is cut with a dicing saw to produce 
the changeover section. (The array is cut using a 
dicing saw so that the end-faces are smooth and 
flat.) Fourth, a dicing saw is used to secure the 
array and the transfer substrate to the holder 
with resin. When the resin is cured, the array is 
immersed in water to dissolve the resin securing 
the transfer substrate so that the transfer sub- 


Silicon plate 
with grooves 


1) Constructing the fiber array 


——— 


= Water- 
Transfer souluble 
substrate resin 


2) Securing the transfer substrate 


a 


3) Forming the fiber changeover end-faces 


Actuator Stationary 
table 


4) Securing the array to the actuator and stationary 


table 


5) Removing the transfer substrate 


Fig. 15—Constructing the optical switching path. 
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strate can be removed. (This takes up to two 
hours.) Fifth, the array is transferred to the 
holder and the assembly is dried at room tem- 
perature. The main optical paths are therefore 
constructed without the need for adjustment or 
fiber end-face polishing. 

The fiber array stoppers are trimmed for the 
correct fiber movement after the switch is 
assembled. 


5.2 Forming the V-shaped grooves 

Three V-shaped grooves 150-um wide is 
formed at intervals of 170 um. Figure 16 shows 
the V-shaped grooves. A (100)-oriented silicon 
wafer is etched with an anisotropic etching 
solution in the < 001 > direction. 

The anisotropic etching solution consists of 


ethylene diamine, pyrocatechol, and water?’ ® 


(100) ct 


. 1 
Plane Sy 
was 


150 wm 


170 wm Silicon crystal 


Fig. 16—V-shaped grooves. 


SS er 


170 wm 


Fig. 17—Silicon wafer etched V-shaped groove (SEM), 
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Etching is performed at 118°C for about 
three hours. During etching, nitrogen is used 
to bubble the etching solution to prevent 
oxidization, and the solution is stirred to keep 
the composition uniform. Buffered hydro- 
fluoric acid is used for oxide film etching, and 
hot concentrated sulfuric acid is used for resist 
removal. 

If the composition of the etching solution is 
incorrect or changes during etching, small, 
quadrangular, pyramid-like projections (micro- 
pyramids) develop on the etched surface, thus 
preventing the formation of smooth grooves.) 
Figure 17 shows successfully formed V-shaped 
grooves as seen under a scanning electron 
microscope. Using the above technique, V- 
shaped grooves can be formed with an arrange- 
ment precision of 2 um or better. 


5.3 Forming the end-faces of the fiber change- 
over 

The fiber array is secured with resin to a 
glass transfer substrate, and the fiber array is cut 
with a dicing saw. At this stage, it is important 
to form smooth end-faces with a narrow cut 
spacing. 

A dicing saw with a 35-uym diamond blade 
is used to cut the fiber array into two parts. 
A hard adhesive resin is used to secure the 
fibers to the V-shaped grooves, and the resin 
is completely cured before cutting. If the 
resin is not sufficiently cured, the resin may 
cling to the cut end-faces, resulting in reduced 
optical coupling. 

Figure 18 shows fiber end-faces after cutting. 


——— 


Fig. 18—Fibe 


=== — 


r end-faces after cutting. 
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For samples with a 40-um spacing between fiber 
end-faces, the optical coupling losses ranged 
from 0.5 dB to 0.8 dB. We cut the synthetic 
quartz glass of these samples and inspected the 
roughness of the end-faces. Figure 19 shows that 
the roughness is within 0.1 um. This figure 
suggests a low scattering loss because it is much 
smaller than the 1.3-um optical wavelength. 


5.4 Transfering the fiber arrays to the holder 

The actuator is then assembled in the holder 
and adjusted to the initial position. The cut 
fiber array on the transfer substrate is secured 
to the actuator and the stationary table. Then, 
the array is attached to the holder so that the 
cut section is between the fixed and moving 
sections. To do this, we use a jig that supports 
the array, provides fine holder adjustment, and 
supports the holder. The planes of the supports 
on which the array and holder are retained 
are parallel. The position of the holder is fine- 
ly adjusted in the X, Y, and Z directions and 
in the direction of horizontal rotation. When 
the holder is correctly positioned, resin is 
applied between the holder and the array to 
fasten them together. The holder positioning 
is then checked visually, and the resin allowed 
to harden. 

When the resin has completely hardened, the 
transfer substrate is immersed in water to 
enable its removal. 

The resin used to fasten the array to the 
holder has a volumetric shrinkage factor as low 
as 2 or 3 percent when fully cured. This resin 
can absorb a relative displacement of up to 
about 50 um in the Z direction between the 
array mounting positions on the two sections. 


) 100 200 300 400 
Horizontal spacing (um) 


Fig. 19—Roughness of quartz glass cutting plane. 
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Table 3. Characteristics of the optical switch 


Item Specification 
Optical characteristics 
Insertion loss 0.8 dB 
Cross talk < —50 dB 
Switching speed 10 ms 


Electrical characteristics 
Input voltage 5V 
Input current 100 mA 


55x 30x6mm 


Case dimensions 


20 mm 


Fig. 20—2 x 2 optical switch. 


This absorbtion keeps the relative displacement 
between the transferred arrays to within 2 um. 
The increase in insertion loss due to the transfer 
is from 0.05 dB to 0.1 dB, and is mainly caused 
by optical axis displacement and bending along 
the Z axis. 


6. Optical switch evaluation 

Table 3 shows the basic characteristics of a 
2x2 moving fiber optical switch using multi- 
mode fiber. 

The insertion loss is within 0.8 dB and the 
switching time is within 10 ms. The insertion 
loss is greater than the estimated value of 
0.4dB. This discrepancy is caused by resin 
contamination of the fiber changeover end-faces 
during their manufacture, and by optical axis 
displacement in the Z direction and bending 
when the array is transferred. The increase 
in insertion loss caused by forming the fiber 
changeover end-faces is from 0.05 dB to 
0.4 dB, and the increase caused by transfer is 
from 0.05 dB to 0.1 dB. The switching speed is 
affected by chatter caused by second harmonic 
vibration of the LiNbO; bimorph cell. The 
LiNbO, bimorph actuator provides faster 


FUJITSU Sci. Tech. J., 27, 4, (December 1991) 


switching and lower loss than conventional 
optical switches which use solenoids to move 
prisms or mirrors. This improvement makes 
it possible to downsize devices. Figure 20 shows 
a completed 2 x 2 optical switch. 


7. Conclusion 
A bending displacement actuator that was 

free of hysteresis and creep was developed using 
a LiNbO 3 plate having an inversion layer. 
Placing stoppers on both sides of the actuator 
and a load on the LiNbO; plate damps vibra- 
tion so that the actuator can switch within 
10ms. A new construction technique was 
developed to simplify optical axis alignment 
and to obviate the need for fiber end-face polish- 
ing. The technique consists of the following 
three steps: 

1) A highly precise fiber array is constructed by 
arranging fibers in V-shape grooves aniso- 
tropically etched in silicon. The arrangement 
precision is within 2 um. 

2) The fiber array is precision-cut to produce 
changeover sections having a cut surface 
roughness within 0.1 ym. 

3) The cut fiber array is precisely mounted on 
a holder to form the optical circuit. 

The only adjustment required is the trim- 
ming of the stoppers to obtain the correct fiber 
displacement. A2x2 moving-fiber optical 
switch constructed using the above technique 
features an insertion loss that is within 0.8 dB. 

Such a switch switches faster and has fewer 
components than ordinary optical switches that 
use solenoids to move prisms or mirrors. This 
improvement enables device miniaturization. 
The construction technique can also be applied 
to microoptical systems. 
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(49-6103)-690-0 
(1-919)-790-2211 
(64-4)-473-3240 (64-4)-473-3429 


(46-8)-28-0345 
(63-2)-817-7576 


(46-8)-764-7690 
(63-2)-812-4001 
(65)-777-6577 (65)-778-2055 
(1-408 )-988-8012 (1-408)-492-1982 
(60-3)-755-0844 (60-3)-755-6010 
(662)-513-9148 (662)-513-9147 


(1-619)-481-4004 
(66-2)-529-2630 


(1-619)-259-2603 
(66-2)-529-2581 


(55-27)-225-0355 (55-27)-225-0954 


(61-3)-670-4755 (61-3)-670-6183 


(1-303)-682-6400 
(886-2)-7 13-5396 


(1-303)-682-6401 
(886-2)-717-4644 


(The information here as of November 30. 


. 1991) 


FUJITSU LIMITED 


6-1, Marunouchi 1-chome, Chiyoda-ku, Tokyo 100, Japan 
Phone: National (03) 3216-3211 International (Int'l Prefix) 81-3-3216-3211 Telex: J22833 Cable: “FUJITSULIMITED TOKYO” 


